lmstudio-community
/

DeepSeek-Coder-V2-Lite-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

bartowski commited on Jun 21

Commit

c924fc7

•

1 Parent(s): 3b44bbb

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -23,6 +23,10 @@ base_model: deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
 **Original model**: [DeepSeek-Coder-V2-Lite-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct)<br>
 **GGUF quantization:** provided by [bartowski](https://huggingface.co/bartowski) based on `llama.cpp` release [b3166](https://github.com/ggerganov/llama.cpp/releases/tag/b3166)<br>
 ## Model Summary:
 This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions.<br>
@@ -42,6 +46,7 @@ This will format the prompt as follows:
 User: {user_message}
 Assistant: {assistant_message}
 ## Technical Details

 **Original model**: [DeepSeek-Coder-V2-Lite-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct)<br>
 **GGUF quantization:** provided by [bartowski](https://huggingface.co/bartowski) based on `llama.cpp` release [b3166](https://github.com/ggerganov/llama.cpp/releases/tag/b3166)<br>
+## Model Settings:
+Flash attention MUST be **disabled** for this model to work.
 ## Model Summary:
 This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions.<br>
 User: {user_message}
 Assistant: {assistant_message}
+```
 ## Technical Details