yanolja
/

EEVE-Korean-10.8B-v1.0

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

myeongho-jeong commited on Feb 23

Commit

8bc753a

•

1 Parent(s): 498bf08

Update README.md

Files changed (1) hide show

README.md +15 -9

README.md CHANGED Viewed

@@ -4,11 +4,11 @@ base_model: upstage/SOLAR-10.7B-v1.0
 tags:
 - generated_from_trainer
 model-index:
-- name: yanolja/KoSOLAR-10.7B-v0.3
   results: []
 ---
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
-# KoSOLAR-10.7B-v0.3
 ## Join Our Community on Discord!
@@ -52,6 +52,8 @@ Our strategy involved a selective freeze of model parameters. Specifically, we k
 As a result, we froze the internal layers and the first 32,000 `embed_tokens`, directing our training efforts on a rich mix of Korean and multi-lingual corpora. This balanced approach has notably improved the model’s proficiency in Korean, without compromising its original language capabilities.
 ### Usage and Limitations
 Keep in mind that this model hasn't been fine-tuned with instruction-based training. While it excels in Korean language tasks, we advise careful consideration and further training for specific applications.
@@ -86,11 +88,15 @@ Our model’s training was comprehensive and diverse:
 This rigorous approach ensured a comprehensive and contextually rich Korean vocabulary for the model.
-### Usage and Limitations
-Keep in mind that this model hasn't been fine-tuned with instruction-based training. While it excels in Korean language tasks, we advise careful consideration and further training for specific applications.
-### Training Details
-TBU

 tags:
 - generated_from_trainer
 model-index:
+- name: yanolja/EEVE-Korean-10.8B-v1.0
   results: []
 ---
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
+# EEVE-Korean-10.8B-v1.0
 ## Join Our Community on Discord!
 As a result, we froze the internal layers and the first 32,000 `embed_tokens`, directing our training efforts on a rich mix of Korean and multi-lingual corpora. This balanced approach has notably improved the model’s proficiency in Korean, without compromising its original language capabilities.
+For detail, please refer our technical report - [Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models](https://arxiv.org).
 ### Usage and Limitations
 Keep in mind that this model hasn't been fine-tuned with instruction-based training. While it excels in Korean language tasks, we advise careful consideration and further training for specific applications.
 This rigorous approach ensured a comprehensive and contextually rich Korean vocabulary for the model.
+## Citation
+```
+@misc{cui2023ultrafeedback,
+      title={UltraFeedback: Boosting Language Models with High-quality Feedback},
+      author={Ganqu Cui and Lifan Yuan and Ning Ding and Guanming Yao and Wei Zhu and Yuan Ni and Guotong Xie and Zhiyuan Liu and Maosong Sun},
+      year={2023},
+      eprint={2310.01377},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```