nroggendorff
commited on
Commit
•
1422883
1
Parent(s):
130d74b
Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ tags:
|
|
6 |
- trl
|
7 |
- sft
|
8 |
- sgd
|
9 |
-
base_model:
|
10 |
datasets:
|
11 |
- nroggendorff/mayo
|
12 |
model-index:
|
@@ -16,7 +16,7 @@ model-index:
|
|
16 |
|
17 |
# Mayonnaise LLM
|
18 |
|
19 |
-
Mayo is a language model fine-tuned on the [Mayo dataset](https://huggingface.co/datasets/nroggendorff/mayo) using Supervised Fine-Tuning (SFT) and Teacher Reinforced Learning (TRL) techniques. It is based on the [
|
20 |
|
21 |
## Features
|
22 |
|
|
|
6 |
- trl
|
7 |
- sft
|
8 |
- sgd
|
9 |
+
base_model: TinyLlama/TinyLlama-1.1B-Chat-v1.0
|
10 |
datasets:
|
11 |
- nroggendorff/mayo
|
12 |
model-index:
|
|
|
16 |
|
17 |
# Mayonnaise LLM
|
18 |
|
19 |
+
Mayo is a language model fine-tuned on the [Mayo dataset](https://huggingface.co/datasets/nroggendorff/mayo) using Supervised Fine-Tuning (SFT) and Teacher Reinforced Learning (TRL) techniques. It is based on the [TinyLlama Model](TinyLlama/TinyLlama-1.1B-Chat-v1.0)
|
20 |
|
21 |
## Features
|
22 |
|