Show a random question from the moral scenarios evaluation
19c7c67
Corey Morriscommited on
Returning just a single file per model directory. Manually removing gpt-j-6b for now because there is something that is causing problems with processing the data
794b32b
Corey Morriscommited on
added new results
324764c
Corey Morriscommited on
TEMPORARY. deleted gpt-j-6b from subdirectory until problems are fixed
1fef386
Corey Morriscommited on
updated results
aba4fe2
Corey Morriscommited on
updated dev requirements
7681250
Corey Morriscommited on
added dev requirmenents
885ecf8
Corey Morriscommited on
Updated model count
4f20e65
Corey Morriscommited on
Updated contaminated models
e3863f2
Corey Morriscommited on
Added statement of removal of models
96ffe12
Corey Morriscommited on
removed commented code
7fc9618
Corey Morriscommited on
updated update data
280db99
Corey Morriscommited on
removing models that are known to have training data contaminated with evaluations
a5840fb
Corey Morriscommited on
updated with new hugging face results
916604b
Corey Morriscommited on
updated pipeline and init
7f2d984
Corey Morriscommited on
removed commented code
2f457d8
Corey Morriscommited on
added a test
a13887a
Corey Morriscommited on
shortened file name
7622af3
Corey Morriscommited on
shortened file name
38d88f9
Corey Morriscommited on
using URL as file name
25b87bf
Corey Morriscommited on
WIP. Updated download file. Can now download all files. Need to integrate that code to loop through all files to download or combine files first into a single dataframe and then save that
0a77c60
Corey Morriscommited on
added new test for a file that currently can be downloaded
6251f5a
Corey Morriscommited on
Replicating 404 error with a test so I can troubleshoot
9adae3c
Corey Morriscommited on
Updated download_file method
b58e1f0
Corey Morriscommited on
Build URL from file path is working
cc32c4f
Corey Morriscommited on
moved methods to better match flow
f228d38
Corey Morriscommited on
removed most commented out code from details processor
74822dd
Corey Morriscommited on
Find files is working as expected
30fa96a
Corey Morriscommited on
WIP commit. Finding files can be identical as the method in results_data_processor.
c32735e
Corey Morriscommited on
added mostly hardcoded generate url method and test
83a34f0
Corey Morriscommited on
Added download file method and test
513e813
Corey Morriscommited on
Added basic structure of details data processing and testing. For downloading huggingface details dataset files
ee9e25e
Corey Morriscommited on
added todo for test
9f7d306
Corey Morriscommited on
added a TODO
201a72d
Corey Morriscommited on
changed to save and load in a directory
dd61816
Corey Morriscommited on
updated gitignore
a89ad93
Corey Morriscommited on
Updated regression test
5d87f13
Corey Morriscommited on
comparing current code to the saved file from the last commit
ff055eb
Corey Morriscommited on
script to save dataframe to a file only if there are no uncommitted files
7a88af3
Corey Morriscommited on
Added a first regression test attempt. It currently fails and values are hardcoded
3ec98e7
Corey Morriscommited on
fixed test_streamlit_app_runs
5603e9f
Corey Morriscommited on
Fixed type error
e79bcf3
Corey Morriscommited on
WIP commit. Currently have nlargest error
d506f10
Corey Morriscommited on
Added test to test the specific method that is currently producting an error
5b83d0b
Corey Morriscommited on
Added failing integration test. Currently fails because of the addition of the organization to the dataframe
de65005
Corey Morriscommited on
Added organization to dataframe
52d3b03
Corey Morriscommited on
added failing test for new behavior of organization column. Updated test for rows for the newly added rows
02b1702
Corey Morriscommited on
removed code to print the number of outliers. could add it back later as logging potentially