Replete-AI
/

Llama-3-11.5B-Instruct-V2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Llama-3-11.5B-Instruct-V2 / README.md

rombodawg's picture

Update README.md

894ac04 verified 4 months ago

|

No virus

1.72 kB

	---
	license: other
	license_name: llama-3
	license_link: https://llama.meta.com/llama3/license/
	---

	Llama-3-11.5B-Instruct-v2

	Thank you to Meta for the weights for Meta-Llama-3-8B-Instruct

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png)

	This is an upscaling of the Meta-Llama-3-8B-Instruct Ai using techniques created for chargoddard/mistral-11b-slimorca. This Ai model has been upscaled from 8b parameters to 11.5b parameters without any continuous pretraining or fine-tuning.

	Unlike version 1 this model has no issues at fp16 or any quantizations.

	The model that was used to create this one is linked below:

	https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
	______________________________________________________________________________
	- Llama-3-11.5B-Instruct-V2

	\| Metric \| Value \|
	\|----------------------------------\|-------:\|
	\| Avg. \| 63.91 \|
	\| AI2 Reasoning Challenge (25-Shot)\| 57.68 \|
	\| HellaSwag (10-Shot) \| 78.59 \|
	\| MMLU (5-Shot) \| 67.35 \|
	\| TruthfulQA (0-shot) \| 35.86 \|
	\| Winogrande (5-shot) \| 74.74 \|
	\| GSM8k (5-shot) \| 69.37 \|

	- Original Meta-Llama-3-8B-Instruct

	\| Metric \| Value \|
	\|----------------------------------\|-------:\|
	\| Avg. \| 66.87 \|
	\| AI2 Reasoning Challenge (25-Shot)\| 60.75 \|
	\| HellaSwag (10-Shot) \| 78.55 \|
	\| MMLU (5-Shot) \| 67.07 \|
	\| TruthfulQA (0-shot) \| 51.65 \|
	\| Winogrande (5-shot) \| 74.51 \|
	\| GSM8k (5-shot) \| 68.69 \|