Mistral-Inference package

#32

by swtb - opened Jun 5

swtb

Jun 5

Can someone summarise for me what are the key advantages of using mistral-inference? Are there performance issues with the transformers version? In an experiment I am running I find that Mistral instruct is slower than Llama and Phi and I am wondering if this is really because of mistral or if it is because of the transformers implementation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment