Custom DataCollator Bug in RewardTrainer

### Reproduction

If you use a custom `data_collator` with the `RewardTrainer` class defined in the `trl.trainer.reward_trainer.py` file, the variable `max_length` is not defined in Line 177 that causes an undefined error in Line 220 (and Line 236 if the eval_dataset is used).
Is there a specific reason why the `max_length` variable is defined within the if statement `if data_collator is None`? If not, I would suggest it moving outside of this statement. Currently, I have to create a new RewardTrainer class and override the __init__ method if I wanna use a custom data collator.
Thanks!


### System Info

- Platform: Linux-5.15.0-131-generic-x86_64-with-glibc2.35
- Python version: 3.11.6
- PyTorch version: 2.3.1
- CUDA device(s): NVIDIA TITAN RTX
- Transformers version: 4.48.3
- Accelerate version: 1.3.0
- Accelerate config: not found
- Datasets version: 3.2.0
- HF Hub version: 0.28.1
- TRL version: 0.14.0
- bitsandbytes version: 0.45.1
- DeepSpeed version: not installed
- Diffusers version: not installed
- Liger-Kernel version: not installed
- LLM-Blender version: not installed
- OpenAI version: not installed
- PEFT version: 0.14.0

### Checklist

- [x] I have checked that my issue isn't already filed (see [open issues](https://github.com/huggingface/trl/issues?q=is%3Aissue))
- [x] I have included my system information
- [x] Any code provided is minimal, complete, and reproducible ([more on MREs](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks))
- [x] Any code provided is properly formatted in code blocks, (no screenshot, [more on code blocks](https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/creating-and-highlighting-code-blocks))
- [x] Any traceback provided is complete

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Custom DataCollator Bug in RewardTrainer #3101

Reproduction

System Info

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Custom DataCollator Bug in RewardTrainer #3101

Description

Reproduction

System Info

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions