gair-prox
/

TinyLlama-1.1B-ProXMath

Model card Files Files and versions Community

SinclairWang commited on 20 days ago

Commit

007c15e

•

1 Parent(s): 56c42ee

Update README.md

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+datasets:
+- gair-prox/open-web-math-pro
+language:
+- en
+base_model:
+- TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T
+---
+# TinyLlama-1.1B-ProXMath
+<p align="center">
+  <img src="prox-teaser.png">
+</p>
+[ArXiv](http://arxiv.org/abs/xxxx) | [Data: OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
+**TinyLlama-1.1B-ProXMath** is a math-adapted TinyLlama-1.1B model that is continually pre-trained on [OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) (a refined version by ProX) for **15**B tokens.
+## Evaluations
+ProX models are evaluated on 9 common math reasoning benchmarks.
+| Model                   |   asdiv  |  gsm8k  |  mathqa  |   mawps  | minerva_math | mmlu_stem | sat_math |   svamp  |  tabmwp  |  average |
+|-------------------------|:--------:|:-------:|:--------:|:--------:|:------------:|:---------:|:--------:|:--------:|:--------:|:--------:|
+| TinyLlama-1.1B          |   18.0   |   2.8   |   14.6   |   20.2   |      3.2     |    16.3   |   21.9   |   10.9   |   12.5   |   13.4   |
+| TinyLlama-1.1B-ProXMath | **41.9** | **9.0** | **15.6** | **56.9** |    **5.6**   |  **26.8** | **31.2** | **23.8** | **22.2** | **25.7** |
+### Citation
+```
+@misc{TBD
+}
+```