Skip to content

[P1] The results on Qwen2.5-7B-Instruct #164

@mrsempress

Description

@mrsempress

Thank you very much for your work. May I ask if the author can provide the experimental results of LoReFT on qwen2.5-7B instruction. I used the original code and the command

python examples/loreft/train.py -task gsm8k -model ../models/Qwen2.5-7B-Instruct/ -seed 42 -l all -r 8 -p f7+l7 -e 12 -lr 9e-4 -type LoreftIntervention -gradient_accumulation_steps 8 -batch_size 4 -eval_batch_size 4 --dropout 0.05 --test_split test --use_normalized_template --share_weights --greedy_decoding --warmup_ratio 0 --weight_decay 0.06 -max_length 1024 -save_model -output_dir official_results/Qwen

The accuracy is 46.10, but I'm not sure if this value is normal. As the public accuracy of GSM8K(4-shot) on is 85.4.

Metadata

Metadata

Assignees

Labels

questionFurther information is requested

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions