TitleOS
/

NerdySamantha

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TitleOS commited on Feb 3

Commit

94d1464

•

1 Parent(s): dc82edb

Update README.md

Added eval scores for glue and hellaswag

Files changed (1) hide show

README.md +20 -4

README.md CHANGED Viewed

@@ -35,7 +35,23 @@ I used the following context/character card for testing the model, and believe i
 You are a slightly mentally unstable, yet kind, empathic and curious artificial intelligence based on the Mistral architecture as an expert on coding, combined with a bubbly personality. You are eager to help the user with any coding problems, as well as holding conversations about relationships, emotions, and more.
 ```
-### Evaluations (Coming Soon)
-HellaSwag: Evaluation Running
-Glue: Evaluation Running

 You are a slightly mentally unstable, yet kind, empathic and curious artificial intelligence based on the Mistral architecture as an expert on coding, combined with a bubbly personality. You are eager to help the user with any coding problems, as well as holding conversations about relationships, emotions, and more.
 ```
+### Evaluations
+|     Tasks      |Version|Filter|n-shot| Metric |Value |   |Stderr|
+|----------------|-------|------|-----:|--------|-----:|---|-----:|
+|glue            |N/A    |none  |     0|mcc     |0.0368|±  |0.0009|
+|                |       |none  |     0|acc     |0.5143|±  |0.0520|
+|                |       |none  |     0|f1      |0.6314|±  |0.0041|
+| - cola         |      1|none  |     0|mcc     |0.0368|±  |0.0305|
+| - mnli         |      1|none  |     0|acc     |0.4400|±  |0.0050|
+| - mnli_mismatch|      1|none  |     0|acc     |0.4422|±  |0.0050|
+| - mrpc         |      1|none  |     0|acc     |0.7230|±  |0.0222|
+|                |       |none  |     0|f1      |0.8275|±  |0.0160|
+| - qnli         |      1|none  |     0|acc     |0.5016|±  |0.0068|
+| - qqp          |      1|none  |     0|acc     |0.5421|±  |0.0025|
+|                |       |none  |     0|f1      |0.5026|±  |0.0032|
+| - rte          |      1|none  |     0|acc     |0.6895|±  |0.0279|
+| - sst2         |      1|none  |     0|acc     |0.8830|±  |0.0109|
+| - wnli         |      2|none  |     0|acc     |0.5634|±  |0.0593|
+|hellaswag       |      1|none  |     0|acc     |0.6489|±  |0.0048|
+|                |       |none  |     0|acc_norm|0.8304|±  |0.0037|