Update README.md
Browse files
README.md
CHANGED
@@ -117,7 +117,8 @@ The models have been pre-trained using a blend of the following datasets.
|
|
117 |
|Codes|[The Stack](https://huggingface.co/datasets/bigcode/the-stack)|10B
|
118 |
|
119 |
The pre-training was continuously conducted using a total of 10 folds of non-overlapping data, each consisting of approximately 27-28B tokens.
|
120 |
-
We finalized the pre-training with additional (potentially) high-quality 27B tokens data obtained from the identical source
|
|
|
121 |
|
122 |
### Instruction tuning
|
123 |
|
|
|
117 |
|Codes|[The Stack](https://huggingface.co/datasets/bigcode/the-stack)|10B
|
118 |
|
119 |
The pre-training was continuously conducted using a total of 10 folds of non-overlapping data, each consisting of approximately 27-28B tokens.
|
120 |
+
We finalized the pre-training with additional (potentially) high-quality 27B tokens data obtained from the identical source datasets listed above used for the 10-fold data.
|
121 |
+
|
122 |
|
123 |
### Instruction tuning
|
124 |
|