BAAI
/

bunny-phi-2-siglip-lora

Text Generation

Model card Files Files and versions Community

BoyaWu10 commited on Feb 8

Commit

e662a47

•

1 Parent(s): a0bad90

Update readme

Files changed (3) hide show

README.md +18 -0
comparison.png +0 -0
icon.png +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,18 @@

+---
+inference: false
+---
+# Model Card
+<p align="center">
+  <img src="./icon.png" alt="Logo" width="350">
+</p>
+📖 Technical report (coming soon) | 🏠 [Code](https://github.com/BAAI-DCAI/Bunny) | 🐰 [Demo](http://bunny.dataoptim.org/)
+Bunny is a family of lightweight but powerful multimodal models. It offers multiple plug-and-play vision encoders, like EVA-CLIP, SigLIP and language backbones, including Phi-1.5, StableLM-2 and Phi-2. To compensate for the decrease in model size, we construct more informative training data by curated selection from a broader data source. Remarkably, our Bunny-3B model built upon SigLIP and Phi-2 outperforms the state-of-the-art MLLMs, not only in comparison with models of similar size but also against larger MLLM frameworks (7B), and even achieves performance on par with 13B models.
+The model is pretrained on LAION-2M and finetuned on Bunny-695K.
+More details about this model can be found in [GitHub](https://github.com/BAAI-DCAI/Bunny).
+![comparison](comparison.png)

comparison.png ADDED Viewed

icon.png ADDED Viewed