tokyotech-llm
/

Swallow-13b-instruct-hf

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Taishi-N324 commited on May 3

Commit

d736d04

•

1 Parent(s): 9629a9f

Upload README.md

Files changed (1) hide show

README.md +14 -2

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ We are excited to share the release schedule for our latest models:
 ![logo](./logo.png)
 This repository provides large language models developed by [TokyoTech-LLM](https://tokyotech-llm.github.io/).
-Read our [blog post](https://zenn.dev/tokyotech_lm/articles/d6cb3a8fdfc907) or our [paper](https://www.anlp.jp/proceedings/annual_meeting/2024/pdf_dir/A8-5.pdf)
 ## Model Details
@@ -224,7 +224,7 @@ The following datasets were used for continual pre-training.
 - [Japanese Wikipedia](https://dumps.wikimedia.org/other/cirrussearch)
 - [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb)
-- Swallow Corpus
 - [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
@@ -265,3 +265,15 @@ Here are the team members:
   - [Rio Yokota](https://twitter.com/rioyokota)
   - [Kazuki Fujii](https://twitter.com/okoge_kaz)
   - [Taishi Nakamura](https://twitter.com/Setuna7777_2)

 ![logo](./logo.png)
 This repository provides large language models developed by [TokyoTech-LLM](https://tokyotech-llm.github.io/).
+Read our [blog post](https://zenn.dev/tokyotech_lm/articles/d6cb3a8fdfc907) or our [paper](https://arxiv.org/abs/2404.17790)
 ## Model Details
 - [Japanese Wikipedia](https://dumps.wikimedia.org/other/cirrussearch)
 - [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb)
+- [Swallow Corpus](https://arxiv.org/abs/2404.17733)
 - [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
   - [Rio Yokota](https://twitter.com/rioyokota)
   - [Kazuki Fujii](https://twitter.com/okoge_kaz)
   - [Taishi Nakamura](https://twitter.com/Setuna7777_2)
+## How to cite
+```
+@misc{fujii2024continual,
+      title={Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities},
+      author={Kazuki Fujii and Taishi Nakamura and Mengsay Loem and Hiroki Iida and Masanari Ohi and Kakeru Hattori and Hirai Shota and Sakae Mizuki and Rio Yokota and Naoaki Okazaki},
+      year={2024},
+      eprint={2404.17790},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL}
+}
+```