[P1] The results on Qwen2.5-7B-Instruct

Thank you very much for your work. May I ask if the author can provide the experimental results of LoReFT on qwen2.5-7B instruction. I used the original code and the command
```
python examples/loreft/train.py -task gsm8k -model ../models/Qwen2.5-7B-Instruct/ -seed 42 -l all -r 8 -p f7+l7 -e 12 -lr 9e-4 -type LoreftIntervention -gradient_accumulation_steps 8 -batch_size 4 -eval_batch_size 4 --dropout 0.05 --test_split test --use_normalized_template --share_weights --greedy_decoding --warmup_ratio 0 --weight_decay 0.06 -max_length 1024 -save_model -output_dir official_results/Qwen
```
The accuracy is **`46.10`**, but I'm not sure if this value is normal. As [the public accuracy of GSM8K](https://qwenlm.github.io/zh/blog/qwen2.5-llm/)(**4-shot**) on is **85.4**.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[P1] The results on Qwen2.5-7B-Instruct #164

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[P1] The results on Qwen2.5-7B-Instruct #164

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions