Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Running
App
Files
Files
Community
4
d506f10
MMLU-by-task-Leaderboard
4 contributors
History:
94 commits
Corey Morris
WIP commit. Currently have nlargest error
d506f10
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
.gitignore
44 Bytes
Added .gitignore file
about 1 year ago
.gitmodules
106 Bytes
added hugging face evaluation harness results submodule
about 1 year ago
README.md
248 Bytes
initial commit
about 1 year ago
app.py
16.8 kB
WIP commit. Currently have nlargest error
about 1 year ago
requirements.txt
199 Bytes
updated requirements.txt
about 1 year ago
result_data_processor.py
5.54 kB
WIP commit. Currently have nlargest error
about 1 year ago
test_integration.py
1.83 kB
Added test to test the specific method that is currently producting an error
about 1 year ago
test_result_data_processing.py
1.66 kB
Added organization to dataframe
about 1 year ago