generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Issues: huggingface/trl
[Tracking issue] Integrate native liger-kernel losses
#2495
opened Dec 17, 2024 by
qgallouedec
Open
6
[Tracking issue] Wrong loss scaling when accumulating gradient
#2617
opened Jan 23, 2025 by
qgallouedec
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Converting a conversational dataset into a standard dataset [not working]
🐛 bug
Something isn't working
#3490
opened May 23, 2025 by
nbasyl
5 tasks done
RuntimeError: NCCL error: invalid usage when only one GPU
⚡accelerate
Related to accelerate
🐛 bug
Something isn't working
#3487
opened May 23, 2025 by
wa008
5 tasks done
Completions Only Loss is incompatible with use_liger_kernel set as true
🐛 bug
Something isn't working
🏋 SFT
Related to SFT
#3484
opened May 22, 2025 by
arashpreetsinghmor
Vision Fine Tuning Gemma 3 takes Impossiblily High VRam (OOM Error 8xH200)
⚡accelerate
Related to accelerate
🐛 bug
Something isn't working
⚡ PEFT
Related to PEFT
#3481
opened May 22, 2025 by
amanmehra89
5 tasks done
【GRPO】Why are some batches of prompts not involved in training?
#3477
opened May 22, 2025 by
moguizhizi
5 tasks done
[GRPO] completion lengths are incorrectly logged when mask_truncated_completions=True
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3476
opened May 22, 2025 by
edbeeching
5 tasks done
Is it possible to make prompts dynamic (or iterable datasets) in GRPO training?
✨ enhancement
New feature or request
🏋 GRPO
Related to GRPO
#3474
opened May 21, 2025 by
onlyjokers
[GPG][new trainer] Add support to new New feature or request
GPG
method
✨ enhancement
#3472
opened May 20, 2025 by
lerogo
3 tasks done
[SFT] Can't apply_chat_template prompt-completion dataset. so, training result always bad.
🐛 bug
Something isn't working
🏋 SFT
Related to SFT
#3468
opened May 19, 2025 by
HERIUN
5 tasks done
trl vllm server generating stuck
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3467
opened May 19, 2025 by
AdaChambers
5 tasks done
[GRPO] bnb quantization + vllm
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
⚡ PEFT
Related to PEFT
#3466
opened May 18, 2025 by
shon-otmazgin-wix
5 tasks done
PPO Training does not improve SFT model outputs (Metrics identical before and after PPO)
🏋 PPO
Related to PPO
🏋 SFT
Related to SFT
#3464
opened May 18, 2025 by
xmriz
Turn off Accelerate acceleration
⚡accelerate
Related to accelerate
🏋 GRPO
Related to GRPO
#3461
opened May 17, 2025 by
seTalent
Out of Memory when GRPO fine-tune Qwen3 4B model on 80G A100 GPU
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3456
opened May 16, 2025 by
wa008
5 tasks done
PPO training fails when used with accelerate ⚡️ and Deepspeed 🚀
⚡accelerate
Related to accelerate
🚀 deepspeed
Related to deepspeed
🏋 PPO
Related to PPO
🏋 SFT
Related to SFT
#3453
opened May 16, 2025 by
marcellobullo
5 tasks done
GRPO reward=0 and loss=0
🏋 GRPO
Related to GRPO
🏋 Reward
Related to Reward modelling
#3452
opened May 15, 2025 by
LIUyizheSDU
torch distributed training with multi gpus errors in GRPOtrainer
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3451
opened May 15, 2025 by
jinhonglu
5 tasks done
trl vllm-serve not working on latest.
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3450
opened May 15, 2025 by
tcapelle
5 tasks done
Rewards functions for Command Line Interfaces GRPO trainer
📱 cli
Related to the Command-line interface
✨ enhancement
New feature or request
🏋 GRPO
Related to GRPO
🏋 Reward
Related to Reward modelling
#3448
opened May 15, 2025 by
wa008
VLLM Integeration cannot run with data parrallel alone
⚡accelerate
Related to accelerate
🐛 bug
Something isn't working
⚡ PEFT
Related to PEFT
#3446
opened May 14, 2025 by
MAOJIASONG
5 tasks done
[GRPO] num_generations
🏋 GRPO
Related to GRPO
❓ question
Seeking clarification or more information
#3443
opened May 13, 2025 by
shon-otmazgin-wix
5 tasks done
Unstructured data grpo training
🐛 bug
Something isn't working
🏋 GRPO
Related to GRPO
#3441
opened May 13, 2025 by
yuyuhua918
Logging docs appear to be out of date
📚 documentation
Improvements or additions to documentation
#3437
opened May 12, 2025 by
ezyang
5 tasks done
3
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.