Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix unset tokenizer pad_token
#3290 opened Apr 14, 2025 by LeonEricsson Loading…
4 of 5 tasks
[GRPO] Add metrics for low and high clipped token probabilities
#3289 opened Apr 14, 2025 by lewtun Loading…
5 tasks
Modified GRPOTrainer to accumulate gradient within a single training batch
#3288 opened Apr 13, 2025 by jarrelscy Loading…
3 of 5 tasks
Add Ascend NPU support for vLLM server
#3286 opened Apr 12, 2025 by ji-huazhong Loading…
5 tasks
☝️ [GRPO] Generate once per effective batch
#3283 opened Apr 12, 2025 by qgallouedec Loading…
5 tasks
Fixes typo in SFTTrainer
#3282 opened Apr 11, 2025 by taras-sereda Loading…
1 of 5 tasks
vllm-dp-v1
#3281 opened Apr 11, 2025 by shirinyamani Draft
5 tasks
add vllm support for token ids as input
#3280 opened Apr 11, 2025 by wybryan Loading…
Reward takes completion ids
#3272 opened Apr 9, 2025 by qgallouedec Draft
5 tasks
🦙 Llama 4
#3267 opened Apr 9, 2025 by qgallouedec Draft
5 tasks
[SFT] support for ring_attn in SFTTrainer
#3262 opened Apr 8, 2025 by kashif Loading…
5 tasks
[🐯+GRPO] Support FSDP + Fix bug when using LigerGRPO with DDP
#3260 opened Apr 8, 2025 by shivam15s Loading…
1 of 5 tasks
Add a raw generate API to the vLLM server
#3227 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Support iterable datasets in GRPO
#3226 opened Apr 3, 2025 by wilrop Loading…
5 tasks
Adding sampling parameters for vllm generation
#3210 opened Apr 2, 2025 by shaipranesh2 Loading…
GRPO: Scalable training with one LLM/node
#3186 opened Mar 31, 2025 by jglaser Loading…
3 of 5 tasks
Extend BCO Trainer dataset format support
#3134 opened Mar 22, 2025 by reihig-ut Loading…
1 of 5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.