lqtrung1998 commited on
Commit
75d5227
1 Parent(s): 9c35d46

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -18,13 +18,6 @@ This repository contains:
18
 
19
  Note: Our models are tuned based on Codellama, thus, licenses applicable to Codellama, such as [Llama license](https://ai.meta.com/resources/models-and-libraries/llama-downloads/), also hold on these models
20
 
21
- | | Top-1 | Voting@100 | Rerank@100 |
22
- |--------------------------------------------------------------------|:------:|:----------:|:----------:|
23
- | Codellama-7b-hf-SFT-warmup-GSM8k | 63.00 | - | - |
24
- | Codellama-7b-hf-SFT-GSM8k<br>(+Codellama-7b-hf-SFT-Rerank-GSM8k) | 63.68 | 68.0 | 77.0 |
25
- | Codellama-7b-hf-ReFT-GSM8k<br>(+Codellama-7b-hf-ReFT-Rerank-GSM8k) | 75.28 | 78.0 | 81.2 |
26
-
27
-
28
  ## Training Data
29
  The model is trained on GSM8k data with Python SDP CoT format, which can be found [here](https://github.com/lqtrung1998/mwp_ReFT)
30
 
@@ -38,6 +31,14 @@ Rerank model is trained to classify if the output CoT is correct or not using sa
38
  ## Evaluation Results
39
  See evaluations results of the models at table 4 of the research paper.
40
 
 
 
 
 
 
 
 
 
41
  ## Usage
42
  You can use the models through Huggingface's Transformers library or follow scripts in our repo.
43
 
 
18
 
19
  Note: Our models are tuned based on Codellama, thus, licenses applicable to Codellama, such as [Llama license](https://ai.meta.com/resources/models-and-libraries/llama-downloads/), also hold on these models
20
 
 
 
 
 
 
 
 
21
  ## Training Data
22
  The model is trained on GSM8k data with Python SDP CoT format, which can be found [here](https://github.com/lqtrung1998/mwp_ReFT)
23
 
 
31
  ## Evaluation Results
32
  See evaluations results of the models at table 4 of the research paper.
33
 
34
+ Updated results:
35
+ | | Top-1 | Voting@100 | Rerank@100 |
36
+ |--------------------------------------------------------------------|:------:|:----------:|:----------:|
37
+ | Codellama-7b-hf-SFT-warmup-GSM8k | 63.00 | - | - |
38
+ | Codellama-7b-hf-SFT-GSM8k<br>(+Codellama-7b-hf-SFT-Rerank-GSM8k) | 63.68 | 68.0 | 77.0 |
39
+ | Codellama-7b-hf-ReFT-GSM8k<br>(+Codellama-7b-hf-ReFT-Rerank-GSM8k) | 75.28 | 78.0 | 81.2 |
40
+
41
+
42
  ## Usage
43
  You can use the models through Huggingface's Transformers library or follow scripts in our repo.
44