# English to Ẹ̀sán Author: iroro orife ## Data - The JW300 English-Ẹ̀sán (ish) dataset. ## Model - Default Masakhane Transformer translation model. - [Link to google drive folder with models](https://drive.google.com/drive/folders/1FhIlYmQVlVBCUL5t_BJv6fOcVdAGHqZg) ## Analysis The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. Also it is very small, which is the primary limiting factor on being able to learn anything useful. Example 1 ```sh Source: In contrast with the people who show the widespread lack of love today , those who worship Jehovah have genuine love for their fellow man . Reference: Ene ga iJehova ẹlẹnan wo mhọn oyẹẹ da ibo ele , ele bha diabe ene iga Jehova ne bha hoẹmhọn ibo ele . Hypothesis: Ene ẹbho ne bunbun wo manman ha mhọn urẹọbhọ da ẹbho n ``` Example 2 ```sh Source: We should also strive to help others spiritually . Reference: Ahamiẹn mhan re ẹghe bhi otọ rẹ ha luẹ iBaibo , yẹ deba ene gene guanọ nin ele ga iJehova ha muobọ , ọ dẹ rẹkpa mhan rẹ ziẹn ikolu nin mhan bi Jehova koko mhọnlẹn . Hypothesis: Mhan dẹ sabọ rẹkpa mhan rẹ sabọ ha mhọn urẹọbhọ bọsi eria . ``` # Results Tokenization | BLEU dev | BLEU test --- | --- | --- BPE| 4.94 | 6.25 Word-level | 3.39 | 5.30