wenge-research
/

yayi2-30b

Text Generation

Model card Files Files and versions Community

wenge-research commited on Dec 15, 2023

Commit

403cc63

•

1 Parent(s): 321524b

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ license: other
 ## 介绍/Introduction
 YAYI 2 是中科闻歌研发的开源大语言模型，包括 Base 和 Chat 版本，参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型，采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景，我们采用了百万级指令进行微调，同时借助人类反馈强化学习方法，以更好地使模型与人类价值观对齐。本次开源的模型为 YAYI2-30B Base 模型。
-如果您想了解更多关于 YAYI 2 模型的细节，我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节，敬请期待我们的的技术报告🔥。
@@ -29,10 +29,13 @@ For more details about the YAYI 2, please refer to our GitHub repository. Stay t
 ## 模型细节/Model
-| Model Name | Context Length  | 🤗 HF Model Name |
-|:----------|:----------:|:----------:|
-| YAYI2-30B | 4096    | wenge-research/yayi2-30b|

 ## 介绍/Introduction
 YAYI 2 是中科闻歌研发的开源大语言模型，包括 Base 和 Chat 版本，参数规模为 30B。YAYI2-30B 是基于 Transformer 的大语言模型，采用了 2.65 万亿 Tokens 的高质量、多语言语料进行预训练。针对通用和特定领域的应用场景，我们采用了百万级指令进行微调，同时借助人类反馈强化学习方法，以更好地使模型与人类价值观对齐。本次开源的模型为 YAYI2-30B Base 模型。
+如果您想了解更多关于 YAYI 2 模型的细节，我们建议您参阅 [GitHub](https://github.com/wenge-research/YAYI2) 仓库。更多技术细节，敬请期待我们的技术报告🔥。
 ## 模型细节/Model
+| Hyperparameter| Value  |
+|:----------|:----------:|
+| n_layers | 64    |
+| n_heads | 64    |
+| d_model | 16384    |
+| vocab size | 81920    |
+| sequence length | 4096    |