Differences with mistral-7b-v0.2?
#7
by
mallorbc
- opened
From my understanding, the v0.2 model also had a 32k context window(without sliding window). Is the only difference here then the different tokenizers?
yes im also confused ?
was this given other unicode characters ?
such as chinese and japanese and sancrit and amarhiric ? ( the non standards ?) was it trained on multi lingugal ?? what are the actual changes ?
Hi there, the main changes are as mentionned the improved tokenizer and so the higher vocabulary!