Update README.md
Browse files
README.md
CHANGED
@@ -57,7 +57,7 @@ Our average performance for BigBench-Hard: 0.488
|
|
57 |
Average for AGIEval: 0.447
|
58 |
|
59 |
In the Orca paper, they measured their score relative to Vicuna on these evals.
|
60 |
-
We have done the same and have found our score averages to **~103%** of the total
|
61 |
|
62 |
So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
|
63 |
|
|
|
57 |
Average for AGIEval: 0.447
|
58 |
|
59 |
In the Orca paper, they measured their score relative to Vicuna on these evals.
|
60 |
+
We have done the same and have found our score averages to **~103%** of the total performance that was shown in the Orca paper, using the same evaluation methods as outlined in the paper.
|
61 |
|
62 |
So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
|
63 |
|