Update README.md
Browse files
README.md
CHANGED
@@ -80,7 +80,16 @@ While this model was trained on a mix of high-quality datasets, it may still exh
|
|
80 |
|
81 |
## Evaluation
|
82 |
|
83 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
84 |
|
85 |
## Technical Specifications
|
86 |
|
|
|
80 |
|
81 |
## Evaluation
|
82 |
|
83 |
+
| Tasks |Version| Filter |n-shot| Metric | |Value | |Stderr|
|
84 |
+
|-----------------|-------|----------------|-----:|-----------|---|-----:|---|------|
|
85 |
+
|tinyBenchmarks | N/A| | | | | | | |
|
86 |
+
| - tinyArc | 0|none | 25|acc_norm |↑ |0.6056|± | N/A|
|
87 |
+
| - tinyGSM8k | 0|flexible-extract| 5|exact_match|↑ |0.4793|± | N/A|
|
88 |
+
| | |strict-match | 5|exact_match|↑ |0.4793|± | N/A|
|
89 |
+
| - tinyHellaswag | 0|none | 10|acc_norm |↑ |0.8261|± | N/A|
|
90 |
+
| - tinyMMLU | 0|none | 0|acc_norm |↑ |0.6358|± | N/A|
|
91 |
+
| - tinyTruthfulQA| 0|none | 0|acc |↑ |0.5098|± | N/A|
|
92 |
+
| - tinyWinogrande| 0|none | 5|acc_norm |↑ |0.7447|± | N/A|
|
93 |
|
94 |
## Technical Specifications
|
95 |
|