Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove tokenizer creation from sft example script
#4197 opened Oct 2, 2025 by sergiopaniego Loading…
5 tasks
Add trainers taxonomy to docs
#4195 opened Oct 2, 2025 by sergiopaniego Loading…
5 tasks
Replace setup with pyproject
#4194 opened Oct 2, 2025 by albertvillanova Loading…
[Online-DPO] fix the completion_len == max_new_tokens crash 🐛 bug Something isn't working
#4193 opened Oct 2, 2025 by kashif Loading…
Account for additional processor outputs
#4191 opened Oct 1, 2025 by KarelKenens Loading…
Replace unittest with pytest
#4188 opened Oct 1, 2025 by albertvillanova Loading…
Add trust_remote_code to GRPOConfig
#4186 opened Oct 1, 2025 by muupan Loading…
3 of 4 tasks
🐍 Python is dead, long live Python
#4183 opened Sep 30, 2025 by qgallouedec Loading…
[DOCS] Lora without regret
#4181 opened Sep 30, 2025 by burtenshaw Loading…
Updated vLLM integration guide
#4162 opened Sep 29, 2025 by sergiopaniego Loading…
5 tasks
[WIP] Tool call
#4151 opened Sep 26, 2025 by qgallouedec Draft
5 tasks
[TODO] Fix GKD Liger memory spike
#4140 opened Sep 24, 2025 by qgallouedec Loading…
[WIP] Make the CI faster
#4127 opened Sep 23, 2025 by qgallouedec Draft
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084 opened Sep 15, 2025 by sergiopaniego Loading…
5 tasks
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
ProTip! no:milestone will show everything without a milestone.