hiroshi-matsuda-rit
/

bert-base-japanese-basic-char-v2

Inference Endpoints

Model card Files Files and versions Community

hiroshi-matsuda-rit commited on Aug 4, 2021

Commit

6191180

•

1 Parent(s): f8b286f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,4 +8,4 @@ datasets:
 # BERT base Japanese (character-level tokenization with whole word masking, jawiki-20200831)
 This pretrained model is almost the same as [cl-tohoku/bert-base-japanese-char-v2](https://huggingface.co/cl-tohoku/bert-base-japanese-char-v2) but do not need `fugashi` or `unidic_lite`.
-The only difference is in `word_tokenzer` property (specify `basic` instead of `mecab`) in `tokenizer_config.json`.

 # BERT base Japanese (character-level tokenization with whole word masking, jawiki-20200831)
 This pretrained model is almost the same as [cl-tohoku/bert-base-japanese-char-v2](https://huggingface.co/cl-tohoku/bert-base-japanese-char-v2) but do not need `fugashi` or `unidic_lite`.
+The only difference is in `word_tokenzer_type` property (specify `basic` instead of `mecab`) in `tokenizer_config.json`.