6ad8aa3 fe523d6
1
2
3
4
5
--- license: cc-by-nc-4.0 --- Chinese tokenizer trained on Baike by using BPE algorithm