Asking for params

by Manel-Hik - opened Jul 30

Discussion

Manel-Hik

Jul 30

Hi
Thanks for this great effort
Could you share with us parameters used in training?
Thanks in advance

Omartificial-Intelligence-Space

Owner Jul 30

Hi, this is a lora model must be merged with the base model to be used.
If you need a model with all params merged in 16 bits , check here please : https://huggingface.co/Omartificial-Intelligence-Space/Arabic-llama3.1-16bit-FT

Hope I answer your query

Manel-Hik

Jul 31

Hi
thanks for sharing
But I meant params of the finetuning aka: lr, warmup, batch size, quantization (4bit or 8bit)...
Thanks in advance

Omartificial-Intelligence-Space

Owner Jul 31

The based model was loaded in 4 bits and then fine-tuned with the following params:

lora_alpha = 16,
lora_dropout = 0,
bias = "none",
learning_rate = 2e-4,
per_device_train_batch_size = 2,
gradient_accumulation_steps = 4,
warmup_steps = 5,

LaZy3138

Sep 5

what was the rank for this model?

LaZy3138

Sep 5

nvmsaw int eh config file "r": 16

Omartificial-Intelligence-Space

Owner Sep 6

Omartificial-Intelligence-Space changed discussion status to closed Sep 6

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment