Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,10 @@ Use this frankenbase for training.
|
|
14 |
Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
|
15 |
I did not except this repo to blow up and now all the training scripts depend on it.
|
16 |
|
17 |
-
* ##
|
|
|
|
|
|
|
18 |
|
19 |
>>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
|
20 |
>>
|
|
|
14 |
Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
|
15 |
I did not except this repo to blow up and now all the training scripts depend on it.
|
16 |
|
17 |
+
* ## CITE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://huggingface.co/ehartford) OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
|
18 |
+
* https://github.com/cognitivecomputations/grokadamw
|
19 |
+
* https://github.com/SakanaAI/evolutionary-model-merge/
|
20 |
+
* https://huggingface.co/blog/smollm
|
21 |
|
22 |
>>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
|
23 |
>>
|