File size: 2,956 Bytes
92c94de
 
e0e062d
 
 
 
cc7cd4e
 
 
 
 
e0e062d
 
 
cc7cd4e
 
e0e062d
 
 
 
 
 
 
 
 
 
 
 
 
 
cc7cd4e
e0e062d
 
cc7cd4e
 
e0e062d
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
license: apache-2.0
---

The results:

|Vocab size | Greedy search(dev & test) | Modified beam search(dev & test) | Fast beam search (dev & test)  | Fast beam search LG (dev & test) | comments|
|-- | -- | -- | -- | -- | --|
|500 | 4.31 & 4.59 | 4.25 & 4.54 | 4.27 & 4.55 |  4.07 & 4.38 | --epoch 48 --avg 29|



The training command:


```bash
export CUDA_VISIBLE_DEVICES="4,5,6,7"

./pruned_transducer_stateless7_bbpe/train.py \
  --world-size 4 \
  --num-epochs 50 \
  --start-epoch 1 \
  --use-fp16 1 \
  --max-duration 800 \
  --bpe-model data/lang_bbpe_500/bbpe.model \
  --exp-dir pruned_transducer_stateless7_bbpe/exp \
  --lr-epochs 6 \
  --master-port 12535
```


The decoding command:


```bash
for m in greedy_search modified_beam_search fast_beam_search fast_beam_search_LG; do 
    ./pruned_transducer_stateless7_bbpe/decode.py \                                                                                                                                                                                              
      --epoch 48 \                                                                                                                                                                                                                         
      --avg 29 \                                                                                                                                                                                                                           
      --exp-dir ./pruned_transducer_stateless7_bbpe/exp \                                                                                                                                                                                  
      --max-sym-per-frame 1 \                                                                                                                                                                                                              
      --ngram-lm-scale 0.25 \                                                                                                                                                                                                            
      --ilme-scale 0.2 \                                                                                                                                                                                                                 
      --bpe-model data/lang_bbpe_500/bbpe.model \                                                                                                                                                                                         
      --max-duration 2000 \                                                                                                                                                                                                                
      --decoding-method $m
done
```