Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix clip ratio logging
#3506 opened May 28, 2025 by qgallouedec Loading…
5 tasks
Rearrange DPOTrainer
#3501 opened May 27, 2025 by DaizeDong Loading…
2 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
[GKD] Use vllm for the teacher model
#3475 opened May 21, 2025 by kashif Draft
5 tasks
Add support for CB with native transformers
#3471 opened May 20, 2025 by ArthurZucker Loading…
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
add support for image inputs in GRPO
#3460 opened May 16, 2025 by hellopahe Loading…
[SFT] add warning if dataset's input_ids exceed max_length
#3449 opened May 15, 2025 by HERIUN Loading…
1 of 5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
🛠️ quantization support for vllm generation
#3428 opened May 8, 2025 by shirinyamani Loading…
5 tasks
Reintroducing step method in ppo_trainer
#3410 opened May 3, 2025 by jskaf34 Loading…
2 of 5 tasks
fix setup chat format
#3404 opened May 2, 2025 by qgallouedec Draft
5 tasks
Reintroduce generate method for PPOTrainer
#3374 opened Apr 27, 2025 by CloseChoice Loading…
4 tasks done
add support for reward func using nn.Module in GRPOTrainer
#3372 opened Apr 27, 2025 by Tavish9 Loading…
1 of 5 tasks
[Feat] Suppport SGLang as rollout engine of GRPO trainer
#3370 opened Apr 27, 2025 by ryang-max Loading…
2 of 8 tasks
Environments
#3367 opened Apr 26, 2025 by August-murr Draft
add vllm support for token ids as input
#3280 opened Apr 11, 2025 by wybryan Loading…
🦙 Llama 4
#3267 opened Apr 9, 2025 by qgallouedec Draft
5 tasks
ProTip! Filter pull requests by the default branch with base:main.