burakaytan
commited on
Commit
•
c6f3c65
1
Parent(s):
af54a5e
Update README.md
Browse files
README.md
CHANGED
@@ -5,6 +5,8 @@ license: mit
|
|
5 |
🇹🇷 RoBERTaTurk-Small-Clean
|
6 |
|
7 |
## Model description
|
|
|
|
|
8 |
This is a Turkish small clean RoBERTa model, trained to understand Turkish language better.
|
9 |
We used special, clean data from Turkish Wikipedia, Turkish OSCAR, and news websites.
|
10 |
First, we had 38 GB of data, but we took out all the sentences with mistakes in them.
|
|
|
5 |
🇹🇷 RoBERTaTurk-Small-Clean
|
6 |
|
7 |
## Model description
|
8 |
+
It was trained with a clean dataset free of typos.
|
9 |
+
|
10 |
This is a Turkish small clean RoBERTa model, trained to understand Turkish language better.
|
11 |
We used special, clean data from Turkish Wikipedia, Turkish OSCAR, and news websites.
|
12 |
First, we had 38 GB of data, but we took out all the sentences with mistakes in them.
|