Commit History

Fix load_results for None data frames
96f60e1
verified

albertvillanova HF staff commited on

Set result_paths_per_model as State
729af67
verified

albertvillanova HF staff commited on

Improve async loading performance of Details
4a05739
verified

albertvillanova HF staff commited on

Support more than 2 models
bea7063
verified

albertvillanova HF staff commited on

Hide MATH fewshot_config.samples with memory address
22fb9eb
verified

albertvillanova HF staff commited on

Add Download button
b1b50fb
verified

albertvillanova HF staff commited on

Avoid bug with radar plot
9ade1c2
verified

albertvillanova HF staff commited on

Plot also radar
a56da8a
verified

albertvillanova HF staff commited on

Plot with plotly
2b1d96b
verified

albertvillanova HF staff commited on

Extract concat_results function
ea4c670
verified

albertvillanova HF staff commited on

Rename dataframes
ccc567f
verified

albertvillanova HF staff commited on

Sort 3/5/7 subtasks
dcff8f7
verified

albertvillanova HF staff commited on

Fix missing Snark subtask
69bec6e
verified

albertvillanova HF staff commited on

Refactor glob to use the cache of HfFileSystem
7e32ac7
verified

albertvillanova HF staff commited on

Rename load file functions
fae0e19
verified

albertvillanova HF staff commited on

Fix overflow on small screens
d46be0d
verified

albertvillanova HF staff commited on

Update README.md
10753ac
verified

albertvillanova HF staff commited on

Align Details samples sorting by doc_id
6411b1c
verified

albertvillanova HF staff commited on

Update README.md
1d3349d
verified

clefourrier HF staff commited on

Support loading LFS Details files
39ff146
verified

albertvillanova HF staff commited on

Support .json and .jsonl details files
148216f
verified

albertvillanova HF staff commited on

Add explanation that login is required for GPQA Details
e970061
verified

albertvillanova HF staff commited on

Change colormap to PiYG
41fbe9f
verified

albertvillanova HF staff commited on

Rename to Hide Standard Errors
585c3fa
verified

albertvillanova HF staff commited on

Use color map for Results metrics values
581682a
verified

albertvillanova HF staff commited on

Add checkbox in Details to show only differences
6cf57e4
verified

albertvillanova HF staff commited on

Add checkbox in Configs to show only differences
f12aa56
verified

albertvillanova HF staff commited on

Add checkbox in Results to hide stderr
54e105e
verified

albertvillanova HF staff commited on

Implement login for GPQA Details
26ef426
verified

albertvillanova HF staff commited on

Fix loading Details with documents containing end of lines
662ed4b
verified

albertvillanova HF staff commited on

Fix style
611a3ed
verified

albertvillanova HF staff commited on

Fix wrapping to keep non-str data
e3edf6d
verified

albertvillanova HF staff commited on

Make beta-version warning less formal
19a6010
verified

albertvillanova HF staff commited on

Escape HTML tags in data
bd64e7a
verified

albertvillanova HF staff commited on

Fix URL to Leaderboard
7647125
verified

albertvillanova HF staff commited on

Display loading message
8f7c83f
verified

albertvillanova HF staff commited on

Add additional info to task description
651545d
verified

albertvillanova HF staff commited on

Import contants as submodule
30a0c61
verified

albertvillanova HF staff commited on

Improve label of subtasks
b6f3b94
verified

albertvillanova HF staff commited on

Add description of Tasks
ca2b34f
verified

albertvillanova HF staff commited on

Hide Details for GPQA task
5009abb
verified

albertvillanova HF staff commited on

Fix Details subtask info
daff9c0
verified

albertvillanova HF staff commited on

Add warning as beta version
c1fc7f4
verified

albertvillanova HF staff commited on

Use magenta instead of red color
33d0dfb
verified

albertvillanova HF staff commited on

Load Details asynchronously
2f4d877
verified

albertvillanova HF staff commited on

Load results asynchronously
d0f55c6
verified

albertvillanova HF staff commited on

Remove unnecessary iteration
da4a3b1
verified

albertvillanova HF staff commited on