mlabonne
/

TwinLlama-3.1-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TwinLlama-3.1-8B / README.md

mlabonne's picture

Trained with Unsloth

9ca0d38 verified about 1 month ago

|

history blame contribute delete

967 Bytes

	---
	base_model: meta-llama/Meta-Llama-3.1-8B
	datasets:
	- mlabonne/llmtwin
	language:
	- en
	library_name: transformers
	license: apache-2.0
	tags:
	- unsloth
	- trl
	- sft
	---

	![image/png](https://cdn-uploads.huggingface.co/production/uploads/61b8e2ba285851687028d395/Ddo6O27iJ0uFiGp7Y5py1.png)

	# 👥 TwinLlama-3.1-8B

	TwinLlama-3.1-8B is a model created for the [LLM Engineer's Handbook](https://a.co/d/9vYzTUC), trained on [mlabonne/llmtwin](https://huggingface.co/datasets/mlabonne/llmtwin).

	It is designed to act as a digital twin, which is a clone of myself and my co-authors (Paul Iusztin and Alex Vesa), imitating our writing style and drawing knowledge from our articles.

	---

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)