Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix VLM configs in generate_tiny_models
#4101 opened Sep 17, 2025 by albertvillanova Loading…
RewardTrainer refactor
#4093 opened Sep 15, 2025 by qgallouedec Loading…
5 tasks
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091 opened Sep 15, 2025 by ycma8 Loading…
2 of 5 tasks
feat: Add WeaveCallback for W&B Weave integration
#4089 opened Sep 15, 2025 by parambharat Loading…
2 of 5 tasks
fix: use_liger_kernel with IterableDataset
#4087 opened Sep 15, 2025 by jue-jue-zi Loading…
2 of 5 tasks
Update links to docs in README to latest packaged version
#4084 opened Sep 15, 2025 by sergiopaniego Loading…
5 tasks
Fix usage of VLM using text only
#4080 opened Sep 14, 2025 by SamuelBarryCS Loading…
Add config_init_kwargs option in GRPOConfig
#4069 opened Sep 12, 2025 by hokuyama0106 Loading…
2 of 5 tasks
Add VLM support to RLOO trainer
#4067 opened Sep 11, 2025 by behroozazarkhalili Loading…
feat: Add NPU and XPU support for activation offloading
#4056 opened Sep 10, 2025 by zilongzheng Loading…
2 of 5 tasks
Enable XPU for vllm client
#4031 opened Sep 8, 2025 by jiqing-feng Loading…
vllm sleep mode support
#4028 opened Sep 8, 2025 by ved1beta Loading…
2 of 5 tasks
Fix: undefined current_gradient_accumulation_steps
#4014 opened Sep 5, 2025 by ysjprojects Loading…
2 of 5 tasks
Improve typing of SFT trainer
#4007 opened Sep 4, 2025 by cyyever Loading…
ProTip! Adding no:label will show everything without a label.