Why is vocab_size larget than tokens?

#6
by wjq521168 - opened

The README.md shows: vocab_size = 32000, but tokens is 65534, i am not sure there's an error here.

Sign up or log in to comment