Casual-Autopsy
/

L3-Umbral-Mind-RP-v2.0-8B

Text Generation

Not-For-All-Audiences

nsfw

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Casual-Autopsy commited on Jul 22

Commit

fd9439c

•

1 Parent(s): 7b66b68

Update README.md

Files changed (1) hide show

README.md +18 -14

README.md CHANGED Viewed

@@ -177,6 +177,23 @@ The following models were included in the merge:
 * [Nitral-AI/Hathor_Stable-v0.2-L3-8B](https://huggingface.co/Nitral-AI/Hathor_Stable-v0.2-L3-8B)
 * [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)
 ## Secret Sauce
 The following YAML configurations were used to produce this model:
@@ -319,17 +336,4 @@ models:
 merge_method: task_arithmetic
 base_model: Casual-Autopsy/Umbral-Mind-3
 dtype: bfloat16
-```
-# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
-Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Casual-Autopsy__L3-Umbral-Mind-RP-v2.0-8B)
-|      Metric       |Value|
-|-------------------|----:|
-|Avg.               |25.76|
-|IFEval (0-Shot)    |71.23|
-|BBH (3-Shot)       |32.49|
-|MATH Lvl 5 (4-Shot)|10.12|
-|GPQA (0-shot)      | 4.92|
-|MuSR (0-shot)      | 5.55|
-|MMLU-PRO (5-shot)  |30.26|

 * [Nitral-AI/Hathor_Stable-v0.2-L3-8B](https://huggingface.co/Nitral-AI/Hathor_Stable-v0.2-L3-8B)
 * [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)
+## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Casual-Autopsy__L3-Umbral-Mind-RP-v2.0-8B)
+*Explaination for AI RP newbies:** IFEval is the most important evaluation for RP AIs as it determines how well it can follow OOC, Lorebooks, and most importantly character cards.
+The rest don't matter. At least not nearly as much as IFEval.
+|      Metric       |Value|
+|-------------------|----:|
+|Avg.               |25.76|
+|IFEval (0-Shot)    |71.23|
+|BBH (3-Shot)       |32.49|
+|MATH Lvl 5 (4-Shot)|10.12|
+|GPQA (0-shot)      | 4.92|
+|MuSR (0-shot)      | 5.55|
+|MMLU-PRO (5-shot)  |30.26|
 ## Secret Sauce
 The following YAML configurations were used to produce this model:
 merge_method: task_arithmetic
 base_model: Casual-Autopsy/Umbral-Mind-3
 dtype: bfloat16
+```