-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: huggingface/open-r1
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
When the chat_template is not set in the YAML configuration file, crashes
#621
opened Apr 24, 2025 by
dignfei
Loading…
Extend max_model_length to prevent context truncation
#463
opened Mar 3, 2025 by
eldarkurtic
Loading…
feat: make reward functions to support parallel computation
#398
opened Feb 23, 2025 by
0x404
Loading…
Fix: Default value of
cosine_min_value_wrong
parameter
#305
opened Feb 13, 2025 by
zhangsheng377
Loading…
[GRPO] generate with prompt containing the first <think> tag
#283
opened Feb 11, 2025 by
kashif
Loading…
Fix: Avoid empty keyword argument in VLLMModelConfig from Makefile
#246
opened Feb 8, 2025 by
mattdepaolis
Loading…
Replace the base model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B to Qwen/Qwen2.5-1.5B-Instruct in GRPO
#198
opened Feb 5, 2025 by
DVampire
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.