starble-dev
/

Mistral-Nemo-12B-Instruct-2407-GGUF

@@ -4,7 +4,7 @@ tags:
 - mistral
 - conversational
 - text-generation-inference
-base_model: mistralai/Mistral-Nemo-Instruct-2407
 library_name: transformers
 ---
@@ -12,11 +12,8 @@ library_name: transformers
 > **Sampling:**<br>
 > Mistral-Nemo-12B-Instruct-2407 is very sensitive to the temperature sampler, try values near **0.3** or else you will get some weird results. This is mentioned by MistralAI at their [Transformers](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407#transformers) section
-**Changes:**
-This model is the original Mistral-Nemo-Instruct-2407 converted to GGUF and quantized using **llama.cpp**.
 **How to Use:**
-As of July 19, 2024, llama.cpp does not support Mistral-Nemo-Instruct-2407. However, you can still use it by building from iamlemec's branch **mistral-nemo** at [llama.cpp GitHub repository](https://github.com/iamlemec/llama.cpp/tree/mistral-nemo).
 ```
 git clone -b mistral-nemo https://github.com/iamlemec/llama.cpp.git
@@ -45,22 +42,10 @@ If you need more clarification on parameters check out the [llama.cpp Server Doc
 Apache 2.0
 **Original Model:**
-[Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407)
-**Note:**
-MistralAI does not have any affiliation with starble-dev.
 # Quants
-| Name | Quant Type | Size |
-| ---- | ---- | ---- |
-| [Mistral-Nemo-12B-Instruct-2407-Q2_K.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q2_K.gguf) | Q2_K | 4.79 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q3_K.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q3_K.gguf) | Q3_K | 6.08 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q4_K_S.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q4_K_S.gguf) | Q4_K_S | 7.12 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q4_K_M.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q4_K_M.gguf) | Q4_K_M | 7.48 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q5_K_M.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q5_K_M.gguf) | Q5_K_M | 8.73 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q6_K.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q6_K.gguf) | Q6_K | 10.1 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q8_0.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q8_0.gguf) | Q8_0 | 13 GB |
-| [Mistral-Nemo-12B-Instruct-2407-Q8_0_L.gguf](https://huggingface.co/starble-dev/Mistral-Nemo-12B-Instruct-2407-GGUF/blob/main/Mistral-Nemo-12B-Instruct-2407-Q8_0_L.gguf) | Q8_0 | 13.7 GB |
 > [!NOTE]
 > **Note: Q8_0_L**<br>

 - mistral
 - conversational
 - text-generation-inference
+base_model: BeaverAI/mistral-dory-12b
 library_name: transformers
 ---
 > **Sampling:**<br>
 > Mistral-Nemo-12B-Instruct-2407 is very sensitive to the temperature sampler, try values near **0.3** or else you will get some weird results. This is mentioned by MistralAI at their [Transformers](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407#transformers) section
 **How to Use:**
+As of July 20, 2024, llama.cpp does not support Mistral-Nemo-Instruct-2407. However, you can still use it by building from iamlemec's branch **mistral-nemo** at [llama.cpp GitHub repository](https://github.com/iamlemec/llama.cpp/tree/mistral-nemo).
 ```
 git clone -b mistral-nemo https://github.com/iamlemec/llama.cpp.git
 Apache 2.0
 **Original Model:**
+[BeaverAI/mistral-dory-12b](https://huggingface.co/BeaverAI/mistral-dory-12b)
 # Quants
+Quantizing currently
 > [!NOTE]
 > **Note: Q8_0_L**<br>