Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update wordle.py example with masking of env tokens
#4895 opened Jan 26, 2026 by sergiopaniego Loading…
5 tasks
Expose generation index to tool callables in GRPOTrainer
#4894 opened Jan 25, 2026 by lukehinds Loading…
4 tasks done
Upgrade GitHub Actions to latest versions
#4893 opened Jan 24, 2026 by salmanmkc Loading…
[GRPO] feat: Geometric Sequence Masking
#4891 opened Jan 24, 2026 by LeonEricsson Draft
5 tasks
Fix grpo tool calling
#4890 opened Jan 23, 2026 by akshayballal95 Loading…
2 tasks done
fix(vLLM): Add tool calling support to VLLMClient.chat()
#4889 opened Jan 23, 2026 by kansalaman Loading…
1 of 2 tasks
NeMo-Gym Integration
#4848 opened Jan 17, 2026 by cmunley1 Loading…
make dpo compatible with fsdp2
#4838 opened Jan 16, 2026 by flutist Loading…
4 of 5 tasks
feat: Support log_completion for swanlab backend
#4826 opened Jan 14, 2026 by ZiyiTsang Loading…
2 of 5 tasks
forward_masked_logits in SFTTrainer
#4794 opened Jan 8, 2026 by qgallouedec Draft
5 tasks
make dpo compatible with qwen3vl
#4773 opened Jan 4, 2026 by flutist Loading…
Extend CLI to orpo trainer
#4757 opened Dec 27, 2025 by murilo-cunha Loading…
3 of 5 tasks
fix: handle None eval_dataset in example code
#4756 opened Dec 27, 2025 by ciaoyizhen Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755 opened Dec 27, 2025 by ciaoyizhen Loading…
2 of 5 tasks
Clarify Accelerate usage in SFTTrainer documentation
#4744 opened Dec 23, 2025 by Likhita-17 Loading…
1 task done
fix minillm trainer
#4743 opened Dec 23, 2025 by t1101675 Loading…
5 tasks
ProTip! Filter pull requests by the default branch with base:main.