Issues · huggingface/trl

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 6

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

398 Open 1,408 Closed

⚡accelerate 🐛 bug

#3487 opened May 23, 2025 by wa008

5 tasks done

⚡accelerate 🐛 bug ⚡ PEFT

#3481 opened May 22, 2025 by amanmehra89

5 tasks done

[GPG][new trainer] Add support to new GPG method ✨ enhancement

#3472 opened May 20, 2025 by lerogo

3 tasks done

🐛 bug 🏋 GRPO

#3467 opened May 19, 2025 by AdaChambers

5 tasks done

🐛 bug 🏋 GRPO ⚡ PEFT

#3466 opened May 18, 2025 by shon-otmazgin-wix

5 tasks done

⚡accelerate 🏋 GRPO

#3461 opened May 17, 2025 by seTalent

🐛 bug 🏋 GRPO

#3456 opened May 16, 2025 by wa008

5 tasks done

🐛 bug 📱 cli 🏋 GRPO 🏋 Reward

#3455 opened May 16, 2025 by wa008

5 tasks done

⚡accelerate 🚀 deepspeed 🏋 PPO 🏋 SFT

#3453 opened May 16, 2025 by marcellobullo

5 tasks done

🏋 GRPO 🏋 Reward

#3452 opened May 15, 2025 by LIUyizheSDU

🐛 bug 🏋 GRPO

#3451 opened May 15, 2025 by jinhonglu

5 tasks done

🐛 bug 🏋 GRPO

#3450 opened May 15, 2025 by tcapelle

5 tasks done

📱 cli ✨ enhancement 🏋 GRPO 🏋 Reward

#3448 opened May 15, 2025 by wa008

⚡accelerate 🐛 bug ⚡ PEFT

#3446 opened May 14, 2025 by MAOJIASONG

5 tasks done

🏋 GRPO ❓ question

#3443 opened May 13, 2025 by shon-otmazgin-wix

5 tasks done

🐛 bug 🏋 GRPO

#3441 opened May 13, 2025 by yuyuhua918

📚 documentation

#3437 opened May 12, 2025 by ezyang

5 tasks done

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Issues: huggingface/trl

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Issues list