Edit model card

Model Description

This model is a fine-tuned version of unsloth/Meta-Llama-3.1-8B-bnb-4bit on cognitivecomputations/Code-290k-ShareGPT-Vicuna in order to answer questions related to programming better. Trained by the Google Colab Notebook provided by Unsloth with small modifications. Dataset format was converted from ShareGPT to Llama 3 in the training notebook. First 10k rows was used in training for demonstration purposes.

Fine-tuning Data

cognitivecomputations/Code-290k-ShareGPT-Vicuna

Training Procedure

Trained on a single A100 on Google Colab. Open In Colab

  • Developed by: candenizkocak
  • License: apache-2.0
  • Finetuned from model : unsloth/Meta-Llama-3.1-8B-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
125
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for candenizkocak/CoderLlama-3.1-8B-GGUF

Quantized
(194)
this model