-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Pull requests: huggingface/open-r1
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
🦜Enhance repetition penalty reward for language that cannot be split by whitespace
#516
opened Mar 18, 2025 by
binary-husky
Loading…
add ddp_timeout argument in SFTConfig to avoid timeout error(issue #160)
#513
opened Mar 17, 2025 by
HowToNameMe
Loading…
Extend max_model_length to prevent context truncation
#463
opened Mar 3, 2025 by
eldarkurtic
Loading…
feat: make reward functions to support parallel computation
#398
opened Feb 23, 2025 by
0x404
Loading…
New GRPO dataset and tasks: formally-verified program correctness
#379
opened Feb 20, 2025 by
ocramz
Loading…
Fix: Default value of
cosine_min_value_wrong
parameter
#305
opened Feb 13, 2025 by
zhangsheng377
Loading…
Simplified installation requirements to support more accelerators
#303
opened Feb 13, 2025 by
ji-huazhong
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-20.