Problem with Textgeneration

by doomgrave - opened Sep 21, 2023

Sep 21, 2023

Hi. im using Textgeneration + llama.cpp + OpenAI api.
Whatever setting i use I get this error:
$This model maximum context length is 2048 tokens. However, your messages resulted in over 1021 tokens and max_tokens is 2048.

But it seems the model should be 4096. May I must set some special parameter?
n_ctx it's already 4096

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment