wenge-research commited on
Commit
0625bed
1 Parent(s): c9b8c63

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -16,15 +16,15 @@ license: other
16
 
17
 
18
  ## 介绍/Introduction
19
- YAYI 2 是中科闻歌研发的开源大语言模型,包括 Base 和 Chat 版本,参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型,采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景,我们采用了百万级指令进行微调,同时借助人类反馈强化学习方法,以更好地使模型与人类价值观对齐。本次开源的模型为 YAYI2-30B Base 模型。
20
 
21
- 如果您想了解更多关于 YAYI 2 模型的细节,我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节,敬请期待我们的技术报告🔥。
22
 
23
 
24
 
25
- YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF). We opensource the pre-trained language model in this release, namely **YAYI2-30B**.
26
 
27
- For more details about the YAYI 2, please refer to our GitHub repository. Stay tuned for more technical details in our upcoming technical report! 🔥
28
 
29
 
30
  ## 模型细节/Model Details
@@ -42,7 +42,7 @@ For more details about the YAYI 2, please refer to our GitHub repository. Stay t
42
 
43
  * python 3.8及以上版本
44
  * pytorch 2.0.1 及以上版本
45
- * 建议使用 CUDA 11.7 及以上
46
  * 运行 BF16 或 FP16 模型需要至少80GB显存(例如1xA100)
47
 
48
 
@@ -222,7 +222,7 @@ We evaluate our model on standard benchmarks, including C-Eval, MMLU, CMMLU, AGI
222
  <tr>
223
  <td><strong>YAYI2-30B</strong></td>
224
  <td style="text-align: center;">80.9</td>
225
- <td style="text-align: center;">80.5</td>
226
  <td style="text-align: center;"><b>62.0</b></td>
227
  <td style="text-align: center;"><b>84.0</b></td>
228
  <td style="text-align: center;">64.4</td>
@@ -243,7 +243,7 @@ We evaluate our model using the source code from the [OpenCompass Github reposit
243
 
244
  ## 协议/License
245
 
246
- 本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源,社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途,请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱[email protected]。审核通过后,雅意将授予您商用版权许可,请遵循协议中的商业许可限制。
247
 
248
  The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
249
 
@@ -257,7 +257,7 @@ If you are using the resource for your work, please cite our paper.
257
 
258
  ```
259
  @article{YAYI 2,
260
- author = {Yin Luo, Qingchao Kong, Nan Xu, et.al.}},
261
  title = {YAYI 2: Multilingual Open Source Large Language Models},
262
  journal = {arXiv preprint arXiv},
263
  year = {2023}
 
16
 
17
 
18
  ## 介绍/Introduction
19
+ YAYI 2 是中科闻歌研发的开源大语言模型,包括 Base 和 Chat 版本,参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型,采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景,我们采用了百万级指令进行微调,同时借助人类反馈强化学习方法,以更好地使模型与人类价值观对齐。
20
 
21
+ 本次开源的模型为 YAYI2-30B Base 模型。如果您想了解更多关于 YAYI 2 模型的细节,我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节,敬请期待我们的技术报告🔥。
22
 
23
 
24
 
25
+ YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF).
26
 
27
+ We opensource the pre-trained language model in this release, namely **YAYI2-30B**. For more details about the YAYI 2, please refer to our [GitHub](https://github.com/wenge-research/YAYI2) repository. Stay tuned for more technical details in our upcoming technical report! 🔥
28
 
29
 
30
  ## 模型细节/Model Details
 
42
 
43
  * python 3.8及以上版本
44
  * pytorch 2.0.1 及以上版本
45
+ * 建议使用 CUDA 11.7 及以上版本
46
  * 运行 BF16 或 FP16 模型需要至少80GB显存(例如1xA100)
47
 
48
 
 
222
  <tr>
223
  <td><strong>YAYI2-30B</strong></td>
224
  <td style="text-align: center;">80.9</td>
225
+ <td style="text-align: center;"><b>80.5</b></td>
226
  <td style="text-align: center;"><b>62.0</b></td>
227
  <td style="text-align: center;"><b>84.0</b></td>
228
  <td style="text-align: center;">64.4</td>
 
243
 
244
  ## 协议/License
245
 
246
+ 本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源,社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途,请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱 [email protected]。审核通过后,雅意将授予您商用版权许可,请遵循协议中的商业许可限制。
247
 
248
  The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
249
 
 
257
 
258
  ```
259
  @article{YAYI 2,
260
+ author = {Yin Luo, Qingchao Kong, Nan Xu, et.al.},
261
  title = {YAYI 2: Multilingual Open Source Large Language Models},
262
  journal = {arXiv preprint arXiv},
263
  year = {2023}