edaiofficial's picture
initial commits
78aa4ee
|
raw
history blame
1.39 kB

English to Ẹ̀sán

Author: iroro orife

Data

- The JW300 English-Ẹ̀sán (ish) dataset.

Model

Analysis

The dataset requires more preprocessing to remove special characters and Scripture chapters/verse names & figures. Also it is very small, which is the primary limiting factor on being able to learn anything useful.

Example 1

    Source:     In contrast with the people who show the widespread lack of love today , those who worship Jehovah have genuine love for their fellow man .
    Reference:  Ene ga iJehova ẹlẹnan wo mhọn oyẹẹ da ibo ele , ele bha diabe ene iga Jehova ne bha hoẹmhọn ibo ele .
    Hypothesis: Ene ẹbho ne bunbun wo manman ha mhọn urẹọbhọ da ẹbho n

Example 2

    Source:     We should also strive to help others spiritually .
    Reference:  Ahamiẹn mhan re ẹghe bhi otọ rẹ ha luẹ iBaibo , yẹ deba ene gene guanọ nin ele ga iJehova ha muobọ , ọ dẹ rẹkpa mhan rẹ ziẹn ikolu nin mhan bi Jehova koko mhọnlẹn .
    Hypothesis: Mhan dẹ sabọ rẹkpa mhan rẹ sabọ ha mhọn urẹọbhọ bọsi eria .

Results

Tokenization BLEU dev BLEU test
BPE 4.94 6.25
Word-level 3.39 5.30