Quant-Cartel
/

L3-70B-Euryale-v2.1-exl2-rpcal

Model card Files Files and versions Community

rAIfle commited on Jun 13

Commit

27c9ab4

•

1 Parent(s): cb26aad

Update README.md

Files changed (1) hide show

README.md +88 -7

README.md CHANGED Viewed

@@ -1,7 +1,88 @@
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 8 -h 8 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 6 -h 6 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 4.65 -h 6 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 4.5 -h 6 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 3.75 -h 6 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 3.5 -h 6 && \
-bash quant.sh -m "${IN_MODEL}" -t "${HUGGINGFACE_TOKEN}" -r "${OUT_MODEL}" -b 2.25 -h 6 && \

+---
+license: cc-by-nc-4.0
+language:
+- en
+---
+```
+  e88 88e                               d8
+ d888 888b  8888 8888  ,"Y88b 888 8e   d88
+C8888 8888D 8888 8888 "8" 888 888 88b d88888
+ Y888 888P  Y888 888P ,ee 888 888 888  888
+  "88 88"    "88 88"  "88 888 888 888  888
+      b
+      8b,
+  e88'Y88                  d8           888
+ d888  'Y  ,"Y88b 888,8,  d88    ,e e,  888
+C8888     "8" 888 888 "  d88888 d88 88b 888
+ Y888  ,d ,ee 888 888     888   888   , 888
+  "88,d88 "88 888 888     888    "YeeP" 888
+PROUDLY PRESENTS
+```
+# L3-70B-Euryale-v2.1-exl2-rpcal
+Quantized using 200 samples of 8192 tokens from an RP-oriented [PIPPA](https://huggingface.co/datasets/royallab/PIPPA-cleaned) dataset.
+Branches:
+- `main` -- `measurement.json`
+- `8b8h` -- 8bpw, 8bit lm_head
+- `6b6h` -- 6bpw, 6bit lm_head
+- `4.65b6h` -- 4.65bpw, 6bit lm_head
+- `4.5b6h` -- 4.5bpw, 6bit lm_head
+- `3.75b6h` -- 3.75bpw, 6bit lm_head
+- `3.5b6h` -- 3.5bpw, 6bit lm_head
+- `2.25b6h` -- 2.25bpw, 6bit lm_head
+Original model link: [Sao10K/L3-70B-Euryale-v2.1](https://huggingface.co/Sao10K/L3-70B-Euryale-v2.1)
+Original model README below.
+-----
+![Euryale](https://images7.alphacoders.com/921/921311.jpg)
+**She's back!**
+Stheno's Sister Model, designed to impress.
+```
+- Same Dataset used as Stheno v3.2 -> See notes there.
+- LoRA Fine-Tune -> FFT is simply too expensive.
+- Trained over 8x H100 SXMs and then some more afterwards.
+```
+**Testing Notes**
+```
+- Better prompt adherence.
+- Better anatomy / spatial awareness.
+- Adapts much better to unique and custom formatting / reply formats.
+- Very creative, lots of unique swipes.
+- Is not restrictive during roleplays.
+- Feels like a big brained version of Stheno.
+```
+*Likely due to it being a 70B model instead of 8B. Similar vibes comparing back in llama 2, where 70B models were simply much more 'aware' in the subtler areas and contexts a smaller model like a 7B or 13B simply were not able to handle.*
+---
+**Recommended Sampler Settings**:
+```
+Temperature - 1.17
+min_p - 0.075
+Repetition Penalty - 1.10
+```
+**SillyTavern Instruct Settings**:
+<br>Context Template: Llama-3-Instruct-Names
+<br>Instruct Presets: [Euryale-v2.1-Llama-3-Instruct](https://huggingface.co/Sao10K/L3-70B-Euryale-v2.1/blob/main/Euryale-v2.1-Llama-3-Instruct.json)
+---
+As per usual, support me here:
+Ko-fi: https://ko-fi.com/sao10k
+```
+Art by wada_kazu / わだかず (pixiv page private?)
+```