Edit model card

Buy Me A Coffee

This is an experimental merge of models RedPajama-INCITE-Chat-3B-V1 and RedPajama-INCITE-Instruct-3B-V1.
This model is adaptive to prompt templates, but this template is recommended:

HUMAN: {prompt}
ASSISTANT:

Feel free to change HUMAN or ASSISTANT. It will not change much.
GGML versions here (Note that this is only compatible with koboldcpp).

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 39.23
ARC (25-shot) 42.58
HellaSwag (10-shot) 67.48
MMLU (5-shot) 25.99
TruthfulQA (0-shot) 33.62
Winogrande (5-shot) 64.8
GSM8K (5-shot) 0.91

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 39.23
AI2 Reasoning Challenge (25-Shot) 42.58
HellaSwag (10-Shot) 67.48
MMLU (5-Shot) 25.99
TruthfulQA (0-shot) 33.62
Winogrande (5-shot) 64.80
GSM8k (5-shot) 0.91
Downloads last month
1,478
Safetensors
Model size
2.78B params
Tensor type
FP16
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for acrastt/RedPajama-INCITE-Chat-Instruct-3B-V1

Quantizations
2 models

Datasets used to train acrastt/RedPajama-INCITE-Chat-Instruct-3B-V1

Spaces using acrastt/RedPajama-INCITE-Chat-Instruct-3B-V1 23

Evaluation results