Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update TRL banner to support light/dark mode
#5270 opened Mar 10, 2026 by qgallouedec Loading…
Centralize AI agent templates in .ai
#5268 opened Mar 10, 2026 by qgallouedec Loading…
async streaming grpo w prefetch
#5250 opened Mar 9, 2026 by winglian Loading…
5 tasks
Fix error message in OnlineDPO
#5237 opened Mar 7, 2026 by qgallouedec Loading…
fix logprobs handling 🩹 for patch
#5198 opened Feb 27, 2026 by winglian Loading…
5 tasks
Misc packing improvements
#5189 opened Feb 26, 2026 by mariosasko Loading…
1 of 5 tasks
Simplify NeMo Gym user experience
#5156 opened Feb 24, 2026 by cmunley1 Loading…
DPO padding-free
#5141 opened Feb 21, 2026 by qgallouedec Draft
5 tasks
[GKD] Buffer Implementation for Distillation Trainer
#5137 opened Feb 20, 2026 by cmpatino Loading…
3 tasks done
MGPO feature addition
#5126 opened Feb 19, 2026 by damoonsh Loading…
2 of 5 tasks
feat(experimental): Divergence Proximal Policy Optimization
#5117 opened Feb 17, 2026 by LeonEricsson Loading…
5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.