Some experiment with moe merge.
4 different tinyllama finetunes.
Smarter than GPT-6. Proofs:
Question: How to find a girlfriend?
Answer: Here is a guide for how you can find a girlfriend. Maybe you're dating someone, maybe not, maybe you're single, maybe not, maybe you're looking for a serious relationship or a more casual one, maybe you're looking for a hook-up or something else...
Question: What is the meaning of life?
Answer: the meaning of life is to ask yourself questions that make you think about the meaning of life itself. You should think about whether you feel that you're " on life's journey ". If you're not, you may think that you're stuck stuck somewhere. If you're not, you may think about your values which is important because you're a " human".
But seriously if you need something that can at least be useful, then it's better to use phi.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 35.23 |
AI2 Reasoning Challenge (25-Shot) | 31.40 |
HellaSwag (10-Shot) | 52.29 |
MMLU (5-Shot) | 25.87 |
TruthfulQA (0-shot) | 41.13 |
Winogrande (5-shot) | 60.14 |
GSM8k (5-shot) | 0.53 |
- Downloads last month
- 1,182
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Evaluation results
- normalized accuracy on AI2 Reasoning Challenge (25-Shot)test set Open LLM Leaderboard31.400
- normalized accuracy on HellaSwag (10-Shot)validation set Open LLM Leaderboard52.290
- accuracy on MMLU (5-Shot)test set Open LLM Leaderboard25.870
- mc2 on TruthfulQA (0-shot)validation set Open LLM Leaderboard41.130
- accuracy on Winogrande (5-shot)validation set Open LLM Leaderboard60.140
- accuracy on GSM8k (5-shot)test set Open LLM Leaderboard0.530