Edit model card

language: en

rawpowertools/MH_2500T_L_Qwen2_500M_gguf Model Data

Base_Model: unsloth/Qwen2-0.5B

Training_Data: mh_2500_train

Eval_Input: mh_small_test

Merged_Model: rawpowertools/MH_2500T_L_Qwen2_500M

Epochs: 5

Rank: 32

Alpha: 32

LR: 0.0005

LR_Scheduler: linear

ClearML: http://clearml.rptinternal.com:8080/projects/d061c7fcfaa049b69a4ee1ff0ed89be2/experiments/38beaff07fb241ba992901832d3fcfbe/output/log

Downloads last month
51
GGUF
Model size
494M params
Architecture
qwen2

4-bit

8-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .