nisten commited on
Commit
1b4702e
1 Parent(s): f3eb791

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -14,7 +14,10 @@ Use this frankenbase for training.
14
  Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
15
  I did not except this repo to blow up and now all the training scripts depend on it.
16
 
17
- * ## ACKOWLEDGE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://github.com/cognitivecomputations/grokadamw) OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
 
 
 
18
 
19
  >>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
20
  >>
 
14
  Sorry for the mislabelling, the model is a 0.18b 181m parameter, not 0.15.
15
  I did not except this repo to blow up and now all the training scripts depend on it.
16
 
17
+ * ## CITE WORK FROM THIS HF PAGE AND [@cognitivecompai](https://huggingface.co/ehartford) OPTIMIZER ON YOUR FUTURE PAPERS OR I WILL DRAG YOUR ORG ON TWITTER LIKE I DID WITH COHERE LOL (we're cool now btw, visited them :)
18
+ * https://github.com/cognitivecomputations/grokadamw
19
+ * https://github.com/SakanaAI/evolutionary-model-merge/
20
+ * https://huggingface.co/blog/smollm
21
 
22
  >>[!TIP]🐧 If you're imatient, get the trained checkpoint file that runs on 1 cpu core:
23
  >>