Model Description
This model is a fine-tuned version of unsloth/Meta-Llama-3.1-8B-bnb-4bit on cognitivecomputations/Code-290k-ShareGPT-Vicuna in order to answer questions related to programming better. Trained by the Google Colab Notebook provided by Unsloth with small modifications. Dataset format was converted from ShareGPT to Llama 3 in the training notebook. First 10k rows was used in training for demonstration purposes.
- Developed by: Can Deniz Koçak
- Finetuned from model: unsloth/Meta-Llama-3.1-8B-bnb-4bit
Fine-tuning Data
cognitivecomputations/Code-290k-ShareGPT-Vicuna
Training Procedure
Trained on a single A100 on Google Colab.
- Developed by: candenizkocak
- License: apache-2.0
- Finetuned from model : unsloth/Meta-Llama-3.1-8B-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 125
Model tree for candenizkocak/CoderLlama-3.1-8B-GGUF
Base model
meta-llama/Llama-3.1-8B
Quantized
unsloth/Meta-Llama-3.1-8B-bnb-4bit