Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CONDA-Workshop
/
Data-Contamination-Database
like
16
Running
App
Files
Files
Community
29
888fb82
Data-Contamination-Database
14 contributors
History:
32 commits
OSainz
emilys
Superglue/RealNews Contamination based on "Noise-Robust De-Duplication at Scale" (
#15
)
888fb82
verified
7 months ago
.gitattributes
Safe
1.52 kB
initial commit
8 months ago
.gitignore
Safe
19 Bytes
Fix arxiv links
7 months ago
README.md
Safe
354 Bytes
Update README.md
7 months ago
app.py
Safe
6.25 kB
Add ignorecase to search options
7 months ago
contamination_report.csv
Safe
49.7 kB
Superglue/RealNews Contamination based on "Noise-Robust De-Duplication at Scale" (#15)
7 months ago
dataset.py
Safe
9.64 kB
Add PR links to previous commits
8 months ago
markdown.py
Safe
9.83 kB
update urls
7 months ago
requirements.txt
Safe
73 Bytes
Initital commit
8 months ago
utils.py
Safe
6.11 kB
Get token from environment
8 months ago