Julien Abadji
uj
AI & ML interests
oscar :)
Organizations
uj's activity
fix typo
1
#2 opened 4 months ago
by
uj
About the number of documents
6
#6 opened over 1 year ago
by
lixin4ever
Add info about virus warnings in README.md
#13 opened over 1 year ago
by
uj
Unsafe Files
20
#12 opened over 1 year ago
by
GetzPro
Deduplicated English Corpus
2
#3 opened over 1 year ago
by
conceptofmind
The data size of Chinses is only 385GB
2
#4 opened over 1 year ago
by
zxs1997zju
Data hosting on Huggingface
1
#2 opened over 1 year ago
by
hieuhocnlp
How to download only one language?
2
#1 opened over 1 year ago
by
musabg
how to use it
1
#2 opened over 2 years ago
by
graybyte
Fix typo in dataset card
#9 opened almost 2 years ago
by
albertvillanova
Issue : Dataset "doesn't exist on the Hub"
2
#1 opened about 2 years ago
by
RomanCast
mwparserfromhell: KeyError: "000nbsp [while running 'train/Clean content']" while cleaning Arabic data from 20/09/2022
1
#4 opened about 2 years ago
by
uj
Progression feedback on Beam related processing?
4
#1 opened over 2 years ago
by
uj
Using the Corpus
3
#1 opened over 2 years ago
by
vitvit