SFT on Qwen2.5-1.5B-Instruct is failed #539

HwangYej1 · 2025-03-24T04:18:47Z

this is my config_demo.yaml
model_name_or_path: model/Qwen2.5-1.5B-Instruct
model_revision: main
torch_dtype: bfloat16

attn_implementation: flash_attention_2

Data training arguments

dataset_name: open-r1/OpenR1-Math-220k
dataset_configs:

default
dataset_num_proc: 48

SFT trainer config

bf16: true
do_eval: false
eval_strategy: 'no'
gradient_accumulation_steps: 1
gradient_checkpointing: True
gradient_checkpointing_kwargs:
use_reentrant: false
hub_model_id: Qwen2.5-1.5B-Open-R1-Distill
hub_strategy: every_save
learning_rate: 5.0e-05
log_level: info
logging_steps: 5
logging_strategy: steps
lr_scheduler_type: cosine_with_min_lr
lr_scheduler_kwargs:
min_lr_rate: 0.1
packing: true
max_seq_length: 16384
max_steps: -1
num_train_epochs: 1
output_dir: /model/Qwen2.5-1.5B-Open-R1-math220k-Distill-useliger_8gpu
overwrite_output_dir: true
per_device_eval_batch_size: 16
per_device_train_batch_size: 16
push_to_hub: False
report_to:

wandb
save_strategy: "steps"
save_steps: 100
save_total_limit: 1
seed: 42
use_liger: True
warmup_ratio: 0.05

ddp.yaml
compute_environment: LOCAL_MACHINE
debug: False
distributed_type: MULTI_GPU
downcast_bf16: 'no'
gpu_ids: "0,1,2,3,4,5,6,7"
machine_rank: 0
main_training_function: main
mixed_precision: bf16
num_machines: 1
num_processes: 8
rdzv_backend: static
same_network: true
tpu_env: []
tpu_use_cluster: false
tpu_use_sudo: false
use_cpu: false

loss can decrease at 0.61,but result on math-500 is lower than original model

roymiles · 2025-03-27T08:27:07Z

@HwangYej1 hi, I am getting a similar issue, how did you fix it?

HwangYej1 · 2025-03-27T08:43:12Z

@HwangYej1 hi, I am getting a similar issue, how did you fix it?

不知道为什么，仍然没有复现成功，SFT后测试结果很差

HwangYej1 closed this as completed Mar 27, 2025

HwangYej1 reopened this Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SFT on Qwen2.5-1.5B-Instruct is failed #539

SFT on Qwen2.5-1.5B-Instruct is failed #539

HwangYej1 commented Mar 24, 2025 •

edited

Loading

roymiles commented Mar 27, 2025

HwangYej1 commented Mar 27, 2025

SFT on Qwen2.5-1.5B-Instruct is failed #539

SFT on Qwen2.5-1.5B-Instruct is failed #539

Comments

HwangYej1 commented Mar 24, 2025 • edited Loading

attn_implementation: flash_attention_2

Data training arguments

SFT trainer config

roymiles commented Mar 27, 2025

HwangYej1 commented Mar 27, 2025

HwangYej1 commented Mar 24, 2025 •

edited

Loading