abeja
/

gpt-neox-japanese-2.7b

Text Generation

gpt_neox_japanese

Inference Endpoints

Model card Files Files and versions Community

SO0529 commited on Aug 29, 2022

Commit

7bea384

•

1 Parent(s): 39f56ae

modify: Readme contents

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -76,7 +76,7 @@ for gen_text in tokenizer.batch_decode(gen_tokens, skip_special_tokens=True):
 The model was trained on [Japanese CC-100](http://data.statmt.org/cc-100/ja.txt.xz), [Japanese Wikipedia](https://dumps.wikimedia.org/other/cirrussearch), and [Japanese OSCAR](https://huggingface.co/datasets/oscar).
 # Tokenization
-The model uses a [special sub-word tokenizer](https://github.com/tanreinama/Japanese-BPEEncoder_V2). Please refer the original repository or [GPT-Noex-Japanese](https://huggingface.co/docs/transformers/model_doc/gpt_neox_japanese) in detail.
 # Licenese
 [The MIT license](https://opensource.org/licenses/MIT)

 The model was trained on [Japanese CC-100](http://data.statmt.org/cc-100/ja.txt.xz), [Japanese Wikipedia](https://dumps.wikimedia.org/other/cirrussearch), and [Japanese OSCAR](https://huggingface.co/datasets/oscar).
 # Tokenization
+The model uses a [special sub-word tokenizer](https://github.com/tanreinama/Japanese-BPEEncoder_V2). Please refer the original repository or [GPT-NoeX-Japanese](https://huggingface.co/docs/transformers/model_doc/gpt_neox_japanese) in detail.
 # Licenese
 [The MIT license](https://opensource.org/licenses/MIT)