adding note about the LlamaTokenizerFast is not included in this build so the Inference API will not work. please use the LlamaTokenizerFast from: Doctor-Shotgun/TinyLlama-1.1B-32k-Instruct to use this model at this time to the README 6f804a0 matlok commited on Feb 7
add readme update for showing how this model was created, what it looks like in the header, and that it takes ~80s to merge 5 models 50228f4 matlok commited on Feb 7