Materaili
/

llama-3.1-fine-tuned-model

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

Materaili commited on 6 days ago

Commit

bd8fe57

•

1 Parent(s): 388551c

Create readme.md

Files changed (1) hide show

readme.md +13 -0

readme.md ADDED Viewed

	@@ -0,0 +1,13 @@

+# Base model is LLAMA 3.1 8B
+## Modifications:
+1. Quantization to INT4 for training on COLAB A100 GPU with 40GB of VRAM
+2. LORA for parameter-efficient-fine-tuning which allowed attaching an adapter that was customized for specific task.
+## Observations:
+1. Initial model does not have enough of predictive power to distinguish each entry that is passed during inference
+2. Adapters indeed adapt the model for specific tasks, which was evident, when model changed its predictions towards the majority-class instead of random prediction during inference.
+3. Requirement is easy, adapt the model and passed data to create some predictive power.
+## Actions:
+- Use 70B model
+-