GeneZC commited on
Commit
2061213
1 Parent(s): f9c59fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -10
README.md CHANGED
@@ -45,17 +45,24 @@ Surpassing Vicuna-7B and approximating LLaMA-2-Chat-7B on MT-Bench.
45
 
46
  **Instruction-following Benchmarks**
47
 
48
- |Method|AlpacaEval|MT-Bench|
49
- |--|--|--|
50
- |GPT-4|95.28|9.18|
51
- |Zephyr-7B-Beta|90.60|7.34|
52
- |Phi-2-DPO|81.37|-|
53
- |StableLM Zephyr 3B|76.00|6.64|
54
- |Vicuna-7B|76.84|6.17|
55
- |LLaMA-2-Chat-7B|71.37|6.27|
 
 
 
56
  ||
57
- |MiniChat-3B|48.82|-|
58
- |MiniChat-2-3B|77.30|6.23|
 
 
 
 
59
 
60
  The following is an example code snippet to use MiniChat-2-3B:
61
 
 
45
 
46
  **Instruction-following Benchmarks**
47
 
48
+ |Method|AlpacaEval|MT-Bench|MT-Bench-ZH|
49
+ |--|--|--|--|
50
+ |GPT-4|95.28|9.18|8.96|
51
+ |Zephyr-7B-Beta|90.60|7.34|6.27<sup>#</sup>|
52
+ |Vicuna-7B|76.84|6.17|5.22<sup>#</sup>|
53
+ |LLaMA-2-Chat-7B|71.37|6.27|5.43<sup>#</sup>|
54
+ |Qwen-Chat-7B|-|-|6.24|
55
+ |Phi-2-DPO|81.37|-|1.59<sup>#</sup><sup>$</sup>|
56
+ |StableLM-Zephyr-3B|76.00|6.64|4.31<sup>#</sup>|
57
+ |Rocket-3B|79.75|6.56|4.07<sup>#</sup>|
58
+ |Qwen-Chat-1.8B|-|-|5.65|
59
  ||
60
+ |MiniChat-3B|48.82|-|-|
61
+ |MiniChat-2-3B|77.30|6.23|6.04|
62
+
63
+ <sup>#</sup> specialized mainly for English.
64
+
65
+ <sup>$</sup> finetuned without multi-turn instruction data.
66
 
67
  The following is an example code snippet to use MiniChat-2-3B:
68