tokyotech-llm
/

Llama-3-Swallow-70B-Instruct-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Taishi-N324 commited on Aug 17

Commit

1bda1f0

•

1 Parent(s): 6e5d6c5

Upload README.md

Files changed (1) hide show

README.md +27 -8

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ We are excited to share the release schedule for our latest models:
 ## Swallow Model Index
-|Model|Llama-3-Swallow|Llama3 Swallow Instruct|
 |---|---|---|
 |8B| [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1) |
 |70B| [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-70B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-70B-Instruct-v0.1) |
@@ -205,14 +205,33 @@ Here are the team members:
 If you find our work helpful, please feel free to cite us.
-```tex
-@misc{llama3swallow,
-      title={Llama 3 Swallow},
-      url={https://swallow-llm.github.io/llama3-swallow.en.html},
-      author={Swallow LLM},
-      year={2024},
-}
 ```
 ### Citations

 ## Swallow Model Index
+|Model|Llama-3-Swallow|Llama3 Swallow instruct|
 |---|---|---|
 |8B| [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1) |
 |70B| [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-70B-v0.1) | [Link](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-70B-Instruct-v0.1) |
 If you find our work helpful, please feel free to cite us.
 ```
+@inproceedings{Fujii:COLM2024,
+   title={Continual Pre-Training for Cross-Lingual LLM Adaptation:
+Enhancing Japanese Language Capabilities},
+   author={Kazuki Fujii and Taishi Nakamura and Mengsay Loem and Hiroki
+Iida and Masanari Ohi and Kakeru Hattori and Hirai Shota and Sakae
+Mizuki and Rio Yokota and Naoaki Okazaki},
+   booktitle="Proceedings of the First Conference on Language Modeling",
+   series={COLM},
+   pages="(to appear)",
+   year="2024",
+   month=oct,
+   address={University of Pennsylvania, USA},
+}
+@inproceedings{Okazaki:COLM2024,
+   title={Building a Large Japanese Web Corpus for Large Language Models},
+   author={Naoaki Okazaki and Kakeru Hattori and Hirai Shota and Hiroki
+Iida and Masanari Ohi and Kazuki Fujii and Taishi Nakamura and Mengsay
+Loem and Rio Yokota and Sakae Mizuki},
+   booktitle="Proceedings of the First Conference on Language Modeling",
+   series={COLM},
+   pages="(to appear)",
+   year="2024",
+   month=oct,
+   address={University of Pennsylvania, USA},
+}
 ### Citations