edaiofficial's picture
initial commits
78aa4ee
|
raw
history blame
976 Bytes
# English to Urhobo
Author: iroro orife
## Data
- The JW300 English-Urhobo dataset.
## Model
- Default Masakhane Transformer translation model.
- [Link to google drive folder with models](https://drive.google.com/open?id=1-0REUw5fg_Y13wrKgE9RFD_iljOsXykr)
## Analysis
The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. This will make the model more generally useful outside of religious text translations.
Example 1
```sh
Source: But freedom from what ?
Reference: Ẹkẹvuọvo , ẹdia vọ yen egbomọphẹ na che si ayen nu ?
Hypothesis: ( 1 Pita 3 : 1 ) Ẹkẹvuọvo , die yen egbomọphẹ
```
Example 2
```sh
Source: Today he is serving at Bethel .
Reference: Nonẹna , ọ ga vwẹ Bẹtẹl .
Hypothesis: Nonẹna , ọ ga vwẹ Bẹtẹl asaọkiephana .
```
# Results
Tokenization | BLEU dev | BLEU test
--- | --- | ---
BPE| 15.91 | 28.82
Word-level | 11.80 | 22.39