32k or 8k context?

by mclassHF2023 - opened May 9

mclassHF2023

May 9

This shows up in oobabooga as 8192 context and also seems to generate gibberish when set to higher context. I can't test the original model, but is this meant to be genuine 32k context?

mradermacher

Owner May 9

From what I can see, the mnodel has a base context size of 8192, and uses rope scaling to get to 32k. Anything more you should ask on the original model - this is just quants of it. See the model card for a link.

mradermacher

Owner May 9

Just tested with f16 - seems pretty coherent at 32k context size. Make sure your tool is uptodate w.r.t. llama3.

mradermacher changed discussion status to closed May 9

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment