File size: 2,027 Bytes
c278dba d35217c c278dba c6d6cb9 1d493f0 c6d6cb9 1b3cfb3 96f4634 c278dba d35217c 2f31089 d2e57d4 95bdfcb c278dba a1a4f9a d35217c f697ad1 d35217c c278dba f697ad1 2f31089 c278dba 95bdfcb 324f71c c278dba 2f31089 c278dba 324f71c c278dba 324f71c c278dba 324f71c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
---
library_name: transformers
tags:
- Medical Visual Question Answering
- VQA
- IDEFIC
- 9B
- 4 Bit
- LORA
- Combining base with Adapter models
license: apache-2.0
---
# Contributed by:
- Shashwath P
- Shashank Ashok
- Akilan Yohendiran
# Total downloads all time - 2106
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
The following model is an experimental fine tuned model of the
IDEFIC 9B version, for medical Visual Question Answering.
It uses a dataset combined from SLAKE and VQARAD.
Check the following repository for the notebooks of training,merging and inference.
https://github.com/Shashwathp/Idefic_medical_vqa
### Model Description
<!-- Provide a longer summary of what this model is. -->
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
- **Developed by:** [@Shashwath01,@Akill19,@Shashank91097 ]
- **Model type:** [Multimodal, Visual Question Answering]
- **Language(s) (NLP):** [English]
- **License:** [Apache - 2.0]
- **Finetuned from model [optional]:** [IDEFIC 9B]
### Dataset
https://huggingface.co/datasets/Shashwath01/VQARAD_SLAKE
### Model Sources
<!-- Provide the basic links for the model. -->
- **Repository:** https://github.com/Shashwathp/Idefic_medical_vqa
<!--- **Paper :** https://ieeexplore.ieee.org/document/10616779-->
## How to Get Started with the Model
Check the below link to get started with inferencing.
https://github.com/Shashwathp/Idefic_medical_vqa/blob/main/inference.ipynb
<!--## Citation
If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section.
[1] S. Punneshetty, S. Ashok, M. Niranjanamurthy, and S. V. N. Murthy, "Fine Tuning Idefic 9b With LORA for Multimodal Medical VQA," in *Proceedings of the 2024 International Conference on Knowledge Engineering and Communication Systems (ICKECS)*, India, Apr. 2024, pp. 1-8. DOI: 10.1109/ICKECS61492.2024.10616779.-->
|