Update README.md
Browse files
README.md
CHANGED
@@ -25,9 +25,9 @@ Because GPT-4 has not been fine-tuned on these VQA tasks, the answers it generat
|
|
25 |
|
26 |
|
27 |
| Dataset | Metric | Med-Gemini | Med-PaLM-540B | GPT-4V | LLaVa3-Med|
|
28 |
-
|
29 |
-
| Slake-VQA | Token F1 | 87.5
|
30 |
-
| Path-VQA | Token F1 | 64.7
|
31 |
|
32 |
|
33 |
Table 1 | Multimodal evaluation. Performance comparison of LLaVa3-Med versus state-of-the-art (SoTA) methods.
|
|
|
25 |
|
26 |
|
27 |
| Dataset | Metric | Med-Gemini | Med-PaLM-540B | GPT-4V | LLaVa3-Med|
|
28 |
+
|-----------------------|----------|------------|---------------|--------|-----------|
|
29 |
+
| Slake-VQA | Token F1 | 87.5 | 89.3 | 76.8 | 89.8† |
|
30 |
+
| Path-VQA | Token F1 | 64.7 | 62.7 | 57.7 | 64.9† |
|
31 |
|
32 |
|
33 |
Table 1 | Multimodal evaluation. Performance comparison of LLaVa3-Med versus state-of-the-art (SoTA) methods.
|