|
--- |
|
library_name: transformers |
|
tags: |
|
- Medical Visual Question Answering |
|
- VQA |
|
- IDEFIC |
|
- 9B |
|
- 4 Bit |
|
- LORA |
|
- Combining base with Adapter models |
|
license: apache-2.0 |
|
--- |
|
|
|
# Contributed by: |
|
- Shashwath P |
|
- Shashank Ashok |
|
- Akilan Yohendiran |
|
|
|
|
|
# Total downloads all time - 1126 |
|
|
|
# Model Card for Model ID |
|
|
|
<!-- Provide a quick summary of what the model is/does. --> |
|
The following model is an experimental fine tuned model of the |
|
IDEFIC 9B version, for medical Visual Question Answering. |
|
It uses a dataset combined from SLAKE and VQARAD. |
|
Check the following repository for the notebooks of training,merging and inference. |
|
https://github.com/Shashwathp/Idefic_medical_vqa |
|
|
|
|
|
### Model Description |
|
|
|
<!-- Provide a longer summary of what this model is. --> |
|
|
|
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated. |
|
|
|
- **Developed by:** [@Shashwath01,@Akill19,@Shashank91097 ] |
|
- **Model type:** [Multimodal, Visual Question Answering] |
|
- **Language(s) (NLP):** [English] |
|
- **License:** [Apache - 2.0] |
|
- **Finetuned from model [optional]:** [IDEFIC 9B] |
|
|
|
### Dataset |
|
|
|
https://huggingface.co/datasets/Shashwath01/VQARAD_SLAKE |
|
|
|
### Model Sources |
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
- **Repository:** https://github.com/Shashwathp/Idefic_medical_vqa |
|
- **Paper :** |
|
- **Demo :** |
|
|
|
|
|
## How to Get Started with the Model |
|
|
|
Check the below link to get started with inferencing. |
|
https://github.com/Shashwathp/Idefic_medical_vqa/blob/main/inference.ipynb |
|
|
|
|
|
## Citation [optional] |
|
|
|
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. --> |
|
|
|
**BibTeX:** |
|
|
|
[More Information Needed] |