Casual-Autopsy commited on
Commit
e092060
1 Parent(s): 19de066

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md CHANGED
@@ -103,6 +103,40 @@ The following models were used to make this merge:
103
  * [lighteternal/Llama3-merge-biomed-8b](https://huggingface.co/lighteternal/Llama3-merge-biomed-8b)
104
  * [Casual-Autopsy/Llama3-merge-psychotherapy-8b](https://huggingface.co/Casual-Autopsy/Llama3-merge-psychotherapy-8b)
105
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
106
  ***
107
  ## Secret Sauce
108
 
 
103
  * [lighteternal/Llama3-merge-biomed-8b](https://huggingface.co/lighteternal/Llama3-merge-biomed-8b)
104
  * [Casual-Autopsy/Llama3-merge-psychotherapy-8b](https://huggingface.co/Casual-Autopsy/Llama3-merge-psychotherapy-8b)
105
 
106
+ ***
107
+ ## Evaluation Results
108
+
109
+ ### [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
110
+
111
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Casual-Autopsy__L3-Umbral-Mind-RP-v2.0-8B)
112
+
113
+ **Explaination for AI RP newbies:** IFEval is the most important evaluation for RP AIs as it determines how well it can follow OOC, Lorebooks, and most importantly character cards.
114
+ The rest don't matter. At least not nearly as much as IFEval.
115
+
116
+ |Metric | Value|
117
+ |:------------------|------:|
118
+ |Avg. |N/A|
119
+ |IFEval (0-Shot) |N/A|
120
+ |BBH (3-Shot) |N/A|
121
+ |MATH Lvl 5 (4-Shot)|N/A|
122
+ |GPQA (0-shot) |N/A|
123
+ |MuSR (0-shot) |N/A|
124
+ |MMLU-PRO (5-shot) |N/A|
125
+
126
+ ### [UGI Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard)
127
+
128
+ Information about the metrics can be found at the bottom of the [UGI Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard) in the respective tabs.
129
+
130
+ |Metric(UGI-Leaderboard) | Value | Value | Metric(Writing Style)|
131
+ |:------------------------|:-----:|:-----:|----------------------:|
132
+ |UGI(Avg.) |N/A|N/A|RegV1 |
133
+ |W/10 |N/A|N/A|RegV2 |
134
+ |Unruly |N/A|N/A|MyScore |
135
+ |Internet |N/A|N/A|ASSS |
136
+ |Stats |N/A|N/A|SMOG |
137
+ |Writing |N/A|N/A|Yule |
138
+ |PolContro |N/A| | |
139
+
140
  ***
141
  ## Secret Sauce
142