ganchengguang
/

Yoko_13B_Japanese_QLoRA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ganchengguang commited on Aug 17, 2023

Commit

55c25e9

•

1 Parent(s): 08f07be

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
 ---
 license: mit
 ---

 ---
 license: mit
+language:
+- ja
+- en
+- zh
+tags:
+- LLaMA2
+- Japanese
+- LLM
 ---
+This model is traned with [llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset) dataset. And this model used a few of dataset by 50000 chat samples and 280000 non chat samples.
+Improved performance in Chinese and Japanese.
+Use the QLoRA to fine-tune the vanilla [LLaMA2-13B](https://huggingface.co/NousResearch/Llama-2-7b-hf).
+And you can use test.py to test the model.
+### Recommend Generation parameters:
+* temperature: 0.5~0.7
+* top p: 0.65~1.0
+* top k: 30~50
+* repeat penalty: 1.03~1.17
+Contribute by Yokohama Nationaly University Mori Lab.