dhanushreddy29
commited on
Commit
•
c03dfcd
1
Parent(s):
0a2ed43
Update README.md
Browse files
README.md
CHANGED
@@ -10,4 +10,5 @@ base_model:
|
|
10 |
# Model Card for Model ID
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
-
Just testing out LLM Finetuning
|
|
|
|
10 |
# Model Card for Model ID
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
+
Just testing out LLM Finetuning. Finetuned on [upstage/SOLAR-10.7B-Instruct-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0) using [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs).
|
14 |
+
Followed the Google Colab mentioned in this article: [https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac](https://towardsdatascience.com/fine-tune-a-mistral-7b-model-with-direct-preference-optimization-708042745aac)
|