xDAN2099 commited on
Commit
3629e1f
1 Parent(s): b31f896

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -31,12 +31,12 @@ The following specifications:
31
  - **Maximum Sequence Length (context):** 32,768 tokens
32
  ## Usage
33
 
34
- | Model | Quantized | Size | Hardware Requirement |
35
- |-------------|-----------|--------|--------------------------|
36
- | APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 2x24G GPU memory |
37
- | APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 2x24G GPU memory |
38
- | Large | No | Large | 3x24G GPU memory |
39
- | APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G | 3x24G GPU memory |
40
 
41
 
42
 
 
31
  - **Maximum Sequence Length (context):** 32,768 tokens
32
  ## Usage
33
 
34
+ | Model | Quantized | Size | Context | Hardware Requirement |
35
+ |-------------|-----------|--------|--------------------------| --------------------------|
36
+ | APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 32k | 2x24G GPU memory |
37
+ | APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 32k | 2x24G GPU memory |
38
+ | Large | No | Large | 32k | 2x24G GPU memory |
39
+ | APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G | 32k | 2x24G GPU memory |
40
 
41
 
42