SinclairWang commited on
Commit
003bb45
1 Parent(s): 36f112e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -3
README.md CHANGED
@@ -1,3 +1,39 @@
1
- ---
2
- license: llama2
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama2
3
+ datasets:
4
+ - gair-prox/open-web-math-pro
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - llama2
10
+ - math
11
+ - reasoning
12
+ base_model:
13
+ - meta-llama/Llama-2-7b-hf
14
+ ---
15
+
16
+ # Llama-2-7B-ProXMath
17
+
18
+ <p align="center">
19
+ <img src="prox-teaser.png">
20
+ </p>
21
+
22
+ [ArXiv](http://arxiv.org/abs/xxxx) | [Data: OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) | [Code](https://github.com/GAIR-NLP/program-every-example)
23
+
24
+ **Llama-2-7B-ProXMath** is a math-adapted Llama-2-7B model that is continually pre-trained on [OpenWebMath-Pro](https://huggingface.co/datasets/gair-prox/open-web-math-pro) (a refined version by ProX) for **10**B tokens.
25
+
26
+ ## Evaluations
27
+
28
+ ProX models are evaluated on 9 common math reasoning benchmarks.
29
+
30
+ | Model | asdiv | gsm8k | mathqa | mawps | minerva_math | mmlu_stem | sat_math | svamp | tabmwp | average |
31
+ |---------------------|:-----:|:-----:|:------:|:-----:|:------------:|:---------:|:--------:|:-----:|:------:|:-------:|
32
+ | Llama-2-7B | 51.6 | 14.1 | 12.5 | 63.6 | 3.8 | 32.9 | 34.4 | 39.5 | 30.9 | 31.48 |
33
+ | Llama-2-7B-ProXMath | 63.7 | 30.6 | 40.1 | 79.3 | 16.8 | 43.8 | 53.1 | 50.2 | 37.3 | 46.1 |
34
+
35
+ ### Citation
36
+ ```
37
+ @misc{TBD
38
+ }
39
+ ```