cosimoiaia
commited on
Commit
•
285c1c7
1
Parent(s):
163748b
Update README.md
Browse files
README.md
CHANGED
@@ -23,6 +23,9 @@ An exclusively Italian speaking, instruction finetuned, Large Language model.
|
|
23 |
The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
|
24 |
of a specific language.
|
25 |
|
|
|
|
|
|
|
26 |
## Model Description
|
27 |
|
28 |
Loquace-12B is the first 12B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
|
|
|
23 |
The Loquace Italian LLM models family was created as a proof-of-concept to evaluate on how different model sizes can be fine-tuned using QLoRa on an instruct dataset
|
24 |
of a specific language.
|
25 |
|
26 |
+
The QLoRa (https://github.com/artidoro/qlora) method of fine-tuning significantly lower the resources requirements compared to any other methods available,
|
27 |
+
this allow to easily execute the process on significanly larger dataset while still using consumers GPUs and still achieve high accuracy.
|
28 |
+
|
29 |
## Model Description
|
30 |
|
31 |
Loquace-12B is the first 12B italian Large Language Model trained using QLoRa on a large dataset of 102k question/answer pairs
|