tassadar667
/

ChatGLM6B-Legal

Inference Endpoints

Model card Files Files and versions Community

ChatGLM6B-Legal / ptuning /ds_train_finetune.sh

tassadar667's picture

Upload 64 files

6672870 over 1 year ago

history blame contribute delete

766 Bytes


	LR=1e-4

	MASTER_PORT=$(shuf -n 1 -i 10000-65535)

	deepspeed --num_gpus=4 --master_port $MASTER_PORT main.py \
	--deepspeed deepspeed.json \
	--do_train \
	--train_file AdvertiseGen/train.json \
	--test_file AdvertiseGen/dev.json \
	--prompt_column content \
	--response_column summary \
	--overwrite_cache \
	--model_name_or_path THUDM/chatglm-6b \
	--output_dir ./output/adgen-chatglm-6b-ft-$LR \
	--overwrite_output_dir \
	--max_source_length 64 \
	--max_target_length 64 \
	--per_device_train_batch_size 4 \
	--per_device_eval_batch_size 1 \
	--gradient_accumulation_steps 1 \
	--predict_with_generate \
	--max_steps 5000 \
	--logging_steps 10 \
	--save_steps 1000 \
	--learning_rate $LR \
	--fp16