I hope grpo supports visual modeling #3232

chaodreaming · 2025-04-04T10:19:31Z

I see that trl is just one data processing away from supporting vision

Visual models can also be enhanced themselves using grpo, and other libraries have implemented visual

Aravind-LatentForce · 2025-04-20T23:39:52Z

Vision support(especially for Qwen 2.5 3B and 7B Instruct) is going to be extremely helpful. Anyone working on this?

antichristHater · 2025-05-15T07:37:43Z

Anyone working on this?

It would be really helpful if they were!

Aravind-LatentForce · 2025-05-15T07:56:56Z

Anyone working on this?

It would be really helpful if they were!

I'm still facing issues with Qwen 2.5 VL training using GRPO. It'd be such a nice addition to have when it works.

github-actions bot added 🏋 GRPO Related to GRPO ✨ enhancement New feature or request labels Apr 4, 2025

hellopahe mentioned this issue May 7, 2025

GRPO for vision models too? unslothai/unsloth#1662

Open

hellopahe linked a pull request May 16, 2025 that will close this issue

add support for image inputs in GRPO #3460

Open

Provide feedback