File size: 749 Bytes
f6345bc 5d7c1b7 09b4276 5d7c1b7 0094748 09b4276 15dfe53 f6345bc 5d7c1b7 09b4276 0a89e96 b87986b 09b4276 15fe489 09b4276 2b4230d 857f0c3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
language:
- uk
tags:
- text2text-generation
- punctuation prediction
- punctuation
library_name: generic
license: mit
metrics:
- f1
datasets:
- ubertext2.0
widget:
- text: "доброго вечора ми з україни"
---
# Ukrainian model to restore punctuation and capitalization
This is the NeMo model to restore punctuation and capitalization in sentences, trained on 10m+ sentences from [UberText 2.0 corpus](https://lang.org.ua/en/ubertext/). Basic transformer under the hood is `bert-base-multilingual-cased`.
Model restores the following punctuations -- [? . ,].
It also restores capitalization of words.
Copyright: [Dmytro Chaplynskyi](https://twitter.com/dchaplinsky), [lang-uk](https://lang.org.ua) project, 2022 |