edaiofficial's picture
initial commits
78aa4ee
# English to Kamba
Author: Kathleen Siminyu
## Data
- The JW300 English-Kamba dataset, 58312 lines.
## Model
- Link to google drive folder with model(https://drive.google.com/open?id=1Y7Judrp0bakZh6mN5ScMOJ3pLr3knD2W)
## Analysis
- Tried out different BPE settings and managed some improvements on the baseline. Highest BLEU score I recorded was at BPE 25000. It might be worth exploring different bpe settings for the source and target languages. Results from different settings included in the analysis below.
BPE 4000
```sh
BLEU dev: 14.83
BLEU test: 24.96
```
BPE 5000
```sh
2020-05-10 17:08:17,436 - dev bleu: 15.00 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:08:53,304 - test bleu: 26.06 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 10000
```sh
2020-05-10 17:09:16,286 - dev bleu: 16.63 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:09:51,410 - test bleu: 27.20 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 15000
```sh
2020-05-10 17:10:16,787 - dev bleu: 16.72 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:10:53,812 - test bleu: 27.34 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 20000
```sh
2020-05-10 17:11:20,392 - dev bleu: 16.00 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:11:57,920 - test bleu: 27.04 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 30000
```sh
2020-05-10 17:13:45,951 - dev bleu: 14.93 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:14:28,275 - test bleu: 25.84 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 35000
```sh
2020-05-10 17:15:06,838 - dev bleu: 14.83 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:15:51,339 - test bleu: 25.19 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 40000
```sh
2020-05-10 17:16:32,342 - dev bleu: 15.03 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:17:19,359 - test bleu: 25.87 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
# Results
BPE 25000
```
2020-05-10 17:12:29,184 - dev bleu: 16.70 [Beam search decoding with beam size = 5 and alpha = 1.0]
2020-05-10 17:13:12,772 - test bleu: 27.90 [Beam search decoding with beam size = 5 and alpha = 1.0]
```