File size: 749 Bytes
f6345bc
5d7c1b7
09b4276
5d7c1b7
0094748
09b4276
 
15dfe53
f6345bc
5d7c1b7
09b4276
0a89e96
 
 
b87986b
09b4276
 
 
 
15fe489
09b4276
2b4230d
 
 
 
857f0c3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
language:
  - uk
tags:
  - text2text-generation
  - punctuation prediction
  - punctuation
library_name: generic
license: mit
metrics:
  - f1
datasets:
- ubertext2.0
widget:
- text: "доброго вечора ми з україни"
---

# Ukrainian model to restore punctuation and capitalization

This is the NeMo model to restore punctuation and capitalization in sentences, trained on 10m+ sentences from [UberText 2.0 corpus](https://lang.org.ua/en/ubertext/). Basic transformer under the hood is `bert-base-multilingual-cased`.

Model restores the following punctuations -- [? . ,].

It also restores capitalization of words.

Copyright: [Dmytro Chaplynskyi](https://twitter.com/dchaplinsky), [lang-uk](https://lang.org.ua) project, 2022