File size: 1,321 Bytes
78aa4ee
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
# English to Ẹ̀dó

Author: iroro orife

## Data

	- The JW300 English-Ẹ̀dó (bin) dataset.

## Model

- Default Masakhane Transformer translation model.
- [Link to google drive folder with models](https://drive.google.com/drive/folders/18P6HH9wavVpaR3UufoiUsTeqMnkvc1He)

## Analysis

The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. Also it is very small, which is the primary limiting factor on being able to learn anything useful.

Example 1
```sh
	Source:     What are the rewards for being humble ?
	Reference:  Ma ghaa mu egbe rriotọ , de afiangbe na lae miẹn ?
	Hypothesis: De emwi nọ khẹke ne ọmwa nọ dizigha ọyẹvbu ru ?
```

Example 2
```sh
	Source:     One reason is that he rules with love . Indeed , our hearts are touched by how he chooses to exercise his sovereignty .
	Reference:  Ọna ẹre ọ si ẹre ne Arriọba ọghẹe na maan sẹ ọghe emwa ọvbehe , ere ọ vbe si ẹre ne emwa ne irẹn kha yan na mwẹ agbẹkunsotọ .
	Hypothesis: Ọ keghi re odẹ ọkpa ne ima ya rhiẹre ma wẹẹ ima hoẹmwẹ ọnrẹn kevbe wẹẹ , irẹn ẹre ọ gele mwẹ asẹ ne a ya khaevbisẹ .
```

# Results

Tokenization | BLEU dev | BLEU test
--- | --- | ---
BPE| 7.92 | 12.49
Word-level | 5.99  | 8.24