--num_generations 4 \ --per_device_train_batch_size 8 \ and it takes 48hours ,I will report if this change will down the performance