Is there any possibility of getting a quantized version of this model shortly? This would help use this on consumer GPUs with limited RAM.
Thanks!
· Sign up or log in to comment