Other than these graphs, which are not quite up to scratch with details if we're being honest:
We have conducted mmlu-pro evaluation of the model, here are our results. I hope you find this useful.
· Sign up or log in to comment