Does anyone have this quantized to run with llamacpp? I've seen @cmp-nct quantizes some of these but I haven't found any good documented way to do it on the multimodal models.
Hi @JoshXT , currently we don't have a plan to do that. It needs the whole community to build a better ecosystem on multimodal models.
· Sign up or log in to comment