File size: 1,873 Bytes
ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b ad4769f 702023b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 |
---
library_name: transformers
datasets:
- agentsea/wave-ui-25k
language:
- en
---
# Paligemma WaveUI
Transformers [PaliGemma 3B 448-res weights](https://huggingface.co/google/paligemma-3b-pt-448), fine-tuned on the [WaveUI](https://huggingface.co/datasets/agentsea/wave-ui) dataset for object-detection.
## Model Details
### Model Description
This fine-tune was done atop of the [Paligemma 448 Widgetcap](https://huggingface.co/google/paligemma-3b-ft-widgetcap-448) model, using the [WaveUI](https://huggingface.co/datasets/agentsea/wave-ui) dataset, which contains ~80k examples of labeled UI elements.
The fine-tune was done for the object detection task. Specifically, this model aims to perform well at UI element detection, as part of a wider effort to enable our open-source toolkit for building agents at [AgentSea](https://www.agentsea.ai/).
- **Developed by:** https://agentsea.ai/
- **Language(s) (NLP):** en
- **Finetuned from model:** https://huggingface.co/google/paligemma-3b-ft-widgetcap-448
### Demo
You can find a **demo** for this model [here](https://huggingface.co/spaces/agentsea/paligemma-waveui).
## Notes
- The only task used in the fine-tune was the object detection task, so it might not perform well in other types of tasks.
## Usage
To start using this model, run the following:
```python
from transformers import AutoProcessor, PaliGemmaForConditionalGeneration
model = PaliGemmaForConditionalGeneration.from_pretrained("agentsea/paligemma-3b-ft-widgetcap-waveui-448").eval()
processor = AutoProcessor.from_pretrained("google/paligemma-3b-pt-448")
```
## Data
We used the [WaveUI](https://huggingface.co/datasets/agentsea/wave-ui) dataset for this fine-tune. Before using it, we preprocessed the data to use the Paligemma bounding-box format.
## Evaluation
We will release a full evaluation report soon. Stay tuned! :) |