File size: 714 Bytes
fc9aa86
 
 
 
 
002ee7c
 
 
 
 
 
fc9aa86
 
002ee7c
fc9aa86
 
002ee7c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
""" from https://github.com/keithito/tacotron """

'''
Defines the set of symbols used in text input to the model.
'''

_pad        = '_'
_punctuation = ';:,.!?¡¿—-<>*()…"«»“”~ '
_letters = 'ABCÇDEFGHIJKLMNOPQRSTUVWXYZÂÊÎÔÛâêîôûéÉèåÅÈàÀùÙÌìëöabcçdefghijklmnopqrstuvwxyz'
_letters_ipa = "õ̃ɑɐɒæɓʙβɔɕçɗɖðʤəɘɚɛɜɝɞɟʄɡɠɢʛɦɧħɥʜɨɪʝɭɬɫɮʟɱɯɰŋɳɲɴøɵɸθœɶʘɹɺɾɻʀʁɽʂʃʈʧʉʊʋⱱʌɣɤʍχʎʏʑʐʒʔʡʕʢǀǁǂǃˈˌːˑʼʴʰʱʲʷˠˤ˞↓↑→↗↘'̩'ᵻ"


# Export all symbols:
symbols_pho = [_pad] + list(_punctuation) + list(_letters) + list(_letters_ipa)

# Special symbol ids
SPACE_ID = symbols_pho.index(" ")