cosimoiaia
/

Loquace-12B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cosimoiaia commited on Jun 10, 2023

Commit

285c1c7

•

1 Parent(s): 163748b

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -23,6 +23,9 @@ An exclusively Italian speaking, instruction finetuned, Large Language model.
 The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
 of a specific language.
 ## Model Description
 Loquace-12B is the first 12B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs

 The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
 of a specific language.
+The QLoRa (https://github.com/artidoro/qlora) method of fine-tuning significantly lower the resources requirements compared to any other methods available,
+this allow to easily execute the process on significanly larger dataset while still using consumers GPUs and still achieve high accuracy.
 ## Model Description
 Loquace-12B is the first 12B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs