R136a1/InfinityKuno-2x7B-GGUF

InfinityKuno-2x7B

GGUF-Imatrix quantizations of InfinityKuno-2x7B

Experimental model from Endevor/InfinityRP-v1-7B and SanjiWatsuki/Kunoichi-DPO-v2-7B models. Merged to MoE model with 2x7B parameters.

Using llama.cpp/perplexity with private roleplay dataset.

Alpaca, Extended Alpaca, Roleplay-Alpaca. (Use any Alpaca based prompt formatting and you should be fine.)

Switch: FP16 - GGUF