nikitastheo
/

BERTtime-Stories-100m-nucleus-1

Model card Files Files and versions Community

nikitastheo commited on Sep 17

Commit

aff0e4d

•

1 Parent(s): aa68d5f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -9,6 +9,6 @@ This model uses the LTG-BERT architecture.
 The model was trained on a combination of the BabyLM Dataset, the TinyStories Dataset, and generated data,
 in accordance with the rules of the Stric track, and the 100M word budget.
-The models were trained with 128 token sequence length
 Hyperparameters used and evaluation scores will follow in a subsequent update.

 The model was trained on a combination of the BabyLM Dataset, the TinyStories Dataset, and generated data,
 in accordance with the rules of the Stric track, and the 100M word budget.
+The model was trained with 128 token sequence length
 Hyperparameters used and evaluation scores will follow in a subsequent update.