wenge-research
/

yayi2-30b

Text Generation

Transformers

PyTorch

yayi

custom_code

Model card Files Files and versions Community

wenge-research commited on Dec 15, 2023

Commit

0625bed

•

1 Parent(s): c9b8c63

Update README.md

Browse files

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -16,15 +16,15 @@ license: other
 ## 介绍/Introduction
-YAYI 2 是中科闻歌研发的开源大语言模型，包括 Base 和 Chat 版本，参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型，采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景，我们采用了百万级指令进行微调，同时借助人类反馈强化学习方法，以更好地使模型与人类价值观对齐。本次开源的模型为 YAYI2-30B Base 模型。
-如果您想了解更多关于 YAYI 2 模型的细节，我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节，敬请期待我们的技术报告🔥。
-YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF). We opensource the pre-trained language model in this release, namely **YAYI2-30B**.
-For more details about the YAYI 2, please refer to our GitHub repository. Stay tuned for more technical details in our upcoming technical report! 🔥
 ## 模型细节/Model Details
@@ -42,7 +42,7 @@ For more details about the YAYI 2, please refer to our GitHub repository. Stay t
 * python 3.8及以上版本
 * pytorch 2.0.1 及以上版本
-* 建议使用 CUDA 11.7 及以上
 * 运行 BF16 或 FP16 模型需要至少80GB显存（例如1xA100）
@@ -222,7 +222,7 @@ We evaluate our model on standard benchmarks, including C-Eval, MMLU, CMMLU, AGI
   <tr>
         <td><strong>YAYI2-30B</strong></td>
         <td style="text-align: center;">80.9</td>
-        <td style="text-align: center;">80.5</td>
         <td style="text-align: center;"><b>62.0</b></td>
         <td style="text-align: center;"><b>84.0</b></td>
         <td style="text-align: center;">64.4</td>
@@ -243,7 +243,7 @@ We evaluate our model using the source code from the [OpenCompass Github reposit
 ## 协议/License
-本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源，社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途，请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱[email protected]。审核通过后，雅意将授予您商用版权许可，请遵循协议中的商业许可限制。
 The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
@@ -257,7 +257,7 @@ If you are using the resource for your work, please cite our paper.
 ```
 @article{YAYI 2,
-  author    = {Yin Luo, Qingchao Kong, Nan Xu, et.al.}},
   title     = {YAYI 2: Multilingual Open Source Large Language Models},
   journal   = {arXiv preprint arXiv},
   year      = {2023}

 ## 介绍/Introduction
+YAYI 2 是中科闻歌研发的开源大语言模型，包括 Base 和 Chat 版本，参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型，采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景，我们采用了百万级指令进行微调，同时借助人类反馈强化学习方法，以更好地使模型与人类价值观对齐。
+本次开源的模型为 YAYI2-30B Base 模型。如果您想了解更多关于 YAYI 2 模型的细节，我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节，敬请期待我们的技术报告🔥。
+YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF).
+We opensource the pre-trained language model in this release, namely **YAYI2-30B**. For more details about the YAYI 2, please refer to our  [GitHub](https://github.com/wenge-research/YAYI2)  repository. Stay tuned for more technical details in our upcoming technical report! 🔥
 ## 模型细节/Model Details
 * python 3.8及以上版本
 * pytorch 2.0.1 及以上版本
+* 建议使用 CUDA 11.7 及以上版本
 * 运行 BF16 或 FP16 模型需要至少80GB显存（例如1xA100）
   <tr>
         <td><strong>YAYI2-30B</strong></td>
         <td style="text-align: center;">80.9</td>
+        <td style="text-align: center;"><b>80.5</b></td>
         <td style="text-align: center;"><b>62.0</b></td>
         <td style="text-align: center;"><b>84.0</b></td>
         <td style="text-align: center;">64.4</td>
 ## 协议/License
+本项目中的代码依照 [Apache-2.0](LICENSE) 协议开源，社区使用 YAYI 2 模型和数据需要遵循[雅意YAYI 2 模型社区许可协议](YAYI2_Community_License)。若您需要将雅意 YAYI 2系列模型或其衍生品用作商业用途，请根据[《雅意 YAYI 2 模型商用许可协议》](YAYI2_Commercial_License)将商用许可申请登记信息发送至指定邮箱 [email protected]。审核通过后，雅意将授予您商用版权许可，请遵循协议中的商业许可限制。
 The code in this project is open-sourced under the [Apache-2.0](LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](YAYI2_Community_License). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please submit your commercial license application and registration information to [email protected], following the [YAYI 2 Commercial License](YAYI2_Commercial_License). Upon approval, YAYI will grant you a commercial copyright license, subject to the commercial license restrictions outlined in the agreement.
 ```
 @article{YAYI 2,
+  author    = {Yin Luo, Qingchao Kong, Nan Xu, et.al.},
   title     = {YAYI 2: Multilingual Open Source Large Language Models},
   journal   = {arXiv preprint arXiv},
   year      = {2023}