Spaces:

MMIE
/

README

Sleeping

Lillianwei commited on 23 days ago

Commit

e53fd38

•

1 Parent(s): beb48c6

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ We introduce **MMIE**, a robust, knowledge-intensive benchmark to evaluate inter
 2. **📈 Challenging the Best**: Even top models like **GPT-4o + SDXL** peak at 65.47%, highlighting room for growth in LVLMs.
 3. **🌐 Designed for Interleaved Tasks**: The benchmark supports evaluation across both text and image comprehension with both **multiple-choice and open-ended** formats.
----
 ### 🔧 Dataset Details
 <div align="center">
@@ -48,3 +48,4 @@ We introduce **MMIE**, a robust, knowledge-intensive benchmark to evaluate inter
 </div>
 MMIE is curated to evaluate models' comprehensive abilities in interleaved multimodal comprehension and generation. The dataset features diverse examples, categorized and distributed across different fields as illustrated above. This ensures balanced coverage across various domains of interleaved input/output tasks, supporting accurate and detailed model evaluations.

 2. **📈 Challenging the Best**: Even top models like **GPT-4o + SDXL** peak at 65.47%, highlighting room for growth in LVLMs.
 3. **🌐 Designed for Interleaved Tasks**: The benchmark supports evaluation across both text and image comprehension with both **multiple-choice and open-ended** formats.
+<!-- ---
 ### 🔧 Dataset Details
 <div align="center">
 </div>
 MMIE is curated to evaluate models' comprehensive abilities in interleaved multimodal comprehension and generation. The dataset features diverse examples, categorized and distributed across different fields as illustrated above. This ensures balanced coverage across various domains of interleaved input/output tasks, supporting accurate and detailed model evaluations.
+ -->