Edit model card

experiment_2_8b-fp16

Another experimental train w/ unsloth. This time, roughly 0.6 epochs of the cleaned c2-logs. My metaparams are probably bad, since the loss-value was super weird at the end. Also uploaded another version in the checkpoint-3500-branch that may mitigate some of that.

Downloads last month
16
Safetensors
Model size
8.03B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including rAIfle/experiment_2_8b-fp16