We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I see that trl is just one data processing away from supporting vision
Visual models can also be enhanced themselves using grpo, and other libraries have implemented visual
[no](https://github.com/Deep-Agent/R1-V)
The text was updated successfully, but these errors were encountered:
Vision support(especially for Qwen 2.5 3B and 7B Instruct) is going to be extremely helpful. Anyone working on this?
Sorry, something went wrong.
Anyone working on this?
It would be really helpful if they were!
Anyone working on this? It would be really helpful if they were!
I'm still facing issues with Qwen 2.5 VL training using GRPO. It'd be such a nice addition to have when it works.
Successfully merging a pull request may close this issue.
Feature request
I see that trl is just one data processing away from supporting vision
Motivation
Visual models can also be enhanced themselves using grpo, and other libraries have implemented visual
Your contribution
[no](https://github.com/Deep-Agent/R1-V)
The text was updated successfully, but these errors were encountered: