daryl149/llama-2-7b-chat-hf · tokenizer.model_max

huggingFace1108

Aug 8, 2023

Thanks for this model.

when printing 'tokenizer.model_max_length', I got a number like '1000000000000000019884624838656'.

the model_max_length is supposed to be 4k? Not sure where this behavior stems from.

Thanks

daryl149

Owner Aug 11, 2023

That number is actually correct, because we solved long context.
.
.
.
.
No j/k, idk either, have you tried the same command with meta's version of the llama-2 weights?

esolteric

Aug 27, 2023

Hello, I´m using this model, but since yesterday, when I run it, I´m getting this error. Running on 4090.

Traceback (most recent call last):
File "/root/endpoint.py", line 43, in chat
response = miner.forward(messages, num_replies = n)
File "/root/endpoint.py", line 106, in forward
output = self.model.generate(
File "/opt/conda/lib/python3.10/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
return func(*args, **kwargs)
File "/opt/conda/lib/python3.10/site-packages/transformers/generation/utils.py", line 1485, in generate
return self.sample(
File "/opt/conda/lib/python3.10/site-packages/transformers/generation/utils.py", line 2560, in sample
next_tokens = torch.multinomial(probs, num_samples=1).squeeze(1)
RuntimeError: probability tensor contains either inf, nan or element < 0

Thank you for your help in advance.

chaochaoli

Nov 20, 2023

2048 or 4096？
https://huggingface.co/TigerResearch/tigerbot-13b-chat-v1/discussions/1#:~:text=llama2%E7%9A%84%E5%AE%9E,%E5%89%8D%E9%BB%98%E8%AE%A4%E5%8F%82%E6%95%B0%E5%AF%BC%E8%87%B4%E3%80%82

daryl149
/

llama-2-7b-chat-hf

tokenizer.model_max_length for llama-2-7b-chat-hf