File size: 2,415 Bytes
78aa4ee
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
# English to Luo

Author: Kathleen Siminyu

## Data

	- The JW300 English-Luo dataset, 154941 lines.

## Model
  
	- Link to google drive folder with model(https://drive.google.com/drive/folders/1_q4Gj4cIAbhNxTU3XOa1OH9Sxtg2cvEM?usp=sharing)

## Analysis
	- Tried out different BPE settings and managed some improvements on the baseline. Highest BLEU score I recorded was at BPE 30000. It might be worth exploring different bpe settings for the source 		and target languages. Results from different settings included in the analysis below.

BPE 4000
```sh
	BLEU dev: 23.22
	BLEU test: 32.64
```

BPE 5000
```sh
	2020-05-17 17:50:01,865 -  dev bleu:  23.32 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 17:51:18,137 - test bleu:  32.52 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

BPE 10000
```sh
	2020-05-17 17:52:12,396 -  dev bleu:  24.26 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 17:53:26,893 - test bleu:  32.97 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

BPE 15000
```sh
	2020-05-17 17:54:28,069 -  dev bleu:  24.55 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 17:55:46,813 - test bleu:  33.43 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 20000
```sh
	2020-05-17 18:01:41,447 -  dev bleu:  25.34 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 18:03:03,470 - test bleu:  33.38 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 25000
```sh
	2020-05-17 18:09:26,069 -  dev bleu:  24.81 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 18:10:45,675 - test bleu:  33.68 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 35000
```sh
	2020-05-17 20:35:45,530 -  dev bleu:  25.08 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 20:36:23,282 - test bleu:  33.70 [Beam search decoding with beam size = 5 and alpha = 1.0]
```
BPE 40000
```sh
	2020-05-17 20:43:32,542 -  dev bleu:  24.75 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 20:44:11,245 - test bleu:  33.58 [Beam search decoding with beam size = 5 and alpha = 1.0]
```

# Results
BPE 30000
```
	2020-05-17 20:28:46,395 -  dev bleu:  25.24 [Beam search decoding with beam size = 5 and alpha = 1.0]
	2020-05-17 20:29:20,937 - test bleu:  34.33 [Beam search decoding with beam size = 5 and alpha = 1.0]
```