Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
CoreyMorris
/
MMLU-by-task-Leaderboard
like
13
Running
App
Files
Files
Community
4
9695a47
MMLU-by-task-Leaderboard
4 contributors
History:
62 commits
Corey Morris
Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
9695a47
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
.gitmodules
Safe
106 Bytes
added hugging face evaluation harness results submodule
over 1 year ago
README.md
Safe
248 Bytes
initial commit
over 1 year ago
app.py
12.4 kB
Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average
about 1 year ago
requirements.txt
Safe
199 Bytes
updated requirements.txt
over 1 year ago
result_data_processor.py
Safe
3.69 kB
Moved rank data into a separate method and dataframe
over 1 year ago