We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 529444a commit 0d9911fCopy full SHA for 0d9911f
scripts/train/tulu3/finetune_8b.sh
@@ -37,7 +37,6 @@ uv run python mason.py \
37
--warmup_ratio 0.03 \
38
--weight_decay 0.0 \
39
--num_train_epochs 2 \
40
- --reduce_loss sum \
41
--use_flash_attn \
42
--gradient_checkpointing \
43
--report_to wandb \
0 commit comments