ayoubkirouane
commited on
Commit
•
ced5145
1
Parent(s):
c5f99ec
Update README.md
Browse files
README.md
CHANGED
@@ -11,3 +11,26 @@ datasets:
|
|
11 |
- ayoubkirouane/arxiv-physics
|
12 |
pipeline_tag: text-generation
|
13 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11 |
- ayoubkirouane/arxiv-physics
|
12 |
pipeline_tag: text-generation
|
13 |
---
|
14 |
+
|
15 |
+
|
16 |
+
## Phi-2-arxiv-physics :
|
17 |
+
|
18 |
+
Phi-2-arxiv-physics is a small language model (SLM) derived from Microsoft Research's Phi-2, a Transformer-based model with 2.7 billion parameters.
|
19 |
+
Specifically fine-tuned using the innovative QLoRA (Quantized Low-rank Adapters) technique, this model exhibits remarkable capabilities in the domain of physics-related text.
|
20 |
+
|
21 |
+
|
22 |
+
|
23 |
+
|
24 |
+
## Fine-Tuning:
|
25 |
+
|
26 |
+
The model has been fine-tuned using **QLoRA**, a technique that reduces computational and memory requirements during the fine-tuning process.
|
27 |
+
|
28 |
+
This two-stage approach involves quantizing parameters and introducing learnable low-rank adapters, resulting in faster fine-tuning with reduced memory demands.
|
29 |
+
|
30 |
+
## Data Source:
|
31 |
+
|
32 |
+
**Phi-2-arxiv-physics** is specifically trained on the **arXiv-physics** dataset, consisting of questions and answers related to physics.
|
33 |
+
|
34 |
+
This dataset is sourced from the arXiv repository, encompassing a wide range of topics within the field of physics.
|
35 |
+
|
36 |
+
|