Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,20 @@ license: llama2
|
|
15 |
</p>
|
16 |
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
<font size=4>
|
19 |
|
20 |
| <sup>Model</sup> | <sup>Checkpoint</sup> | <sup>Paper</sup> |<sup>MT-Bench</sup> | <sup>AlpacaEval</sup> | <sup>GSM8k</sup> | <sup>HumanEval</sup> | <sup>License</sup>|
|
|
|
15 |
</p>
|
16 |
|
17 |
|
18 |
+
## News
|
19 |
+
|
20 |
+
- 🔥 🔥 🔥 [08/11/2023] We release **WizardMath** Models.
|
21 |
+
- 🔥 Our **WizardMath-70B-V1.0** model slightly outperforms some closed-source LLMs on the GSM8K, including **ChatGPT 3.5**, **Claude Instant 1** and **PaLM 2 540B**.
|
22 |
+
- 🔥 Our **WizardMath-70B-V1.0** model achieves **81.6 pass@1** on the [GSM8k Benchmarks](https://github.com/openai/grade-school-math), which is **24.8** points higher than the SOTA open-source LLM.
|
23 |
+
- 🔥 Our **WizardMath-70B-V1.0** model achieves **22.7 pass@1** on the [MATH Benchmarks](https://github.com/hendrycks/math), which is **9.2** points higher than the SOTA open-source LLM.
|
24 |
+
|
25 |
+
| Model | Checkpoint | Paper | GSM8k | MATH |Online Demo| License|
|
26 |
+
| ----- |------| ---- |------|-------| ----- | ----- |
|
27 |
+
| WizardMath-70B-V1.0 | 🤗 <a href="https://huggingface.co/WizardLM/WizardMath-70B-V1.0" target="_blank">HF Link</a> | 📃Coming Soon| **81.6** | **22.7** || <a href="https://ai.meta.com/resources/models-and-libraries/llama-downloads/" target="_blank">Llama 2 License </a> |
|
28 |
+
| WizardMath-13B-V1.0 | 🤗 <a href="https://huggingface.co/WizardLM/WizardMath-13B-V1.0" target="_blank">HF Link</a> | 📃Coming Soon| **63.9** | **14.0** | | <a href="https://ai.meta.com/resources/models-and-libraries/llama-downloads/" target="_blank">Llama 2 License </a> |
|
29 |
+
| WizardMath-7B-V1.0 | 🤗 <a href="https://huggingface.co/WizardLM/WizardMath-7B-V1.0" target="_blank">HF Link</a> | 📃Coming Soon| **54.9** | **10.7** | [Demo-1](http://777957f.r10.cpolar.top), [Demo-2](http://2be2671b.r10.cpolar.top) | <a href="https://ai.meta.com/resources/models-and-libraries/llama-downloads/" target="_blank">Llama 2 License </a>|
|
30 |
+
|
31 |
+
|
32 |
<font size=4>
|
33 |
|
34 |
| <sup>Model</sup> | <sup>Checkpoint</sup> | <sup>Paper</sup> |<sup>MT-Bench</sup> | <sup>AlpacaEval</sup> | <sup>GSM8k</sup> | <sup>HumanEval</sup> | <sup>License</sup>|
|