|
# English to Urhobo |
|
|
|
Author: iroro orife |
|
|
|
## Data |
|
|
|
- The JW300 English-Urhobo dataset. |
|
|
|
## Model |
|
|
|
- Default Masakhane Transformer translation model. |
|
- [Link to google drive folder with models](https://drive.google.com/open?id=1-0REUw5fg_Y13wrKgE9RFD_iljOsXykr) |
|
|
|
## Analysis |
|
|
|
The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. This will make the model more generally useful outside of religious text translations. |
|
|
|
Example 1 |
|
```sh |
|
Source: But freedom from what ? |
|
Reference: Ẹkẹvuọvo , ẹdia vọ yen egbomọphẹ na che si ayen nu ? |
|
Hypothesis: ( 1 Pita 3 : 1 ) Ẹkẹvuọvo , die yen egbomọphẹ |
|
``` |
|
|
|
Example 2 |
|
```sh |
|
Source: Today he is serving at Bethel . |
|
Reference: Nonẹna , ọ ga vwẹ Bẹtẹl . |
|
Hypothesis: Nonẹna , ọ ga vwẹ Bẹtẹl asaọkiephana . |
|
``` |
|
|
|
# Results |
|
|
|
Tokenization | BLEU dev | BLEU test |
|
--- | --- | --- |
|
BPE| 15.91 | 28.82 |
|
Word-level | 11.80 | 22.39 |
|
|