---
license: apache-2.0
tags:
- safetensors
- llama
- rp
- roleplay
- sillytavern
language:
- en
---
# #llama-3 #roleplay
GGUF-IQ-Imatrix quants for [Endevor/InfinityRP-v2-8B](https://huggingface.co/Endevor/InfinityRP-v2-8B).
Back at it!
> [!IMPORTANT]
> These quants have been done after the fixes from [llama.cpp/pull/6920](https://github.com/ggerganov/llama.cpp/pull/6920).
> Use **KoboldCpp version 1.64** or higher.
> [!NOTE]
> **Prompt formatting...**
> Alpaca prompt format recommended.
> A safe starting SillyTavern preset can be found [here (simple)](https://huggingface.co/Lewdiculous/Model-Requests/tree/main/data/presets/lewdicu-3.0.2-mistral-0.2).
# Original model information by the author:
![image/png](https://cdn-uploads.huggingface.co/production/uploads/65d4cf2693a0a3744a27536c/V643ZxRyElJidcW3x1AB0.png)
The idea is the same as [InfinityRP v1](https://huggingface.co/Endevor/InfinityRP-v1-7B), but this one is Llama 3 with 16k ctx! Have fun...
### Prompt format: Alpaca.
``"You are now in roleplay chat mode. Engage in an endless chat, always with a creative response. Follow lengths very precisely and create paragraphs accurately. Always wait your turn, next actions and responses. Your internal thoughts are wrapped with ` marks."``
**User Message Prefix = ### Input:**
**Assistant Message Prefix = ### Response:**
**System Message Prefix = ### Instruction:**
**Turn on "Include Names"** (optional)
### Text Length: (use on your System Prompt or ### Response:)
Response: (length = medium) <- [tiny, micro, short, medium, long, enormous, huge, massive, humongous]
### Example:
![example](https://files.catbox.moe/t3hcez.png)