RoPE scaling and max_position_embeddings

#12

by ag0 - opened Aug 3, 2023

ag0

Aug 3, 2023

Hello,

In config.json, a linear rope_scaling of 8 is defined, and max_position_embeddings has been increased to 32768.

However, the huggingface Llama2 doc specifies that when a rope scaling strategy is used, max_position_embeddings should not be updated.
https://huggingface.co/docs/transformers/main/model_doc/llama2#transformers.LlamaConfig.rope_scaling

Wouldn't the existing config result in the RoPE scaling being applied twice (especially when setting trust_remote_code=False)?

Yhyu13

Aug 4, 2023

This should be fixed

Together org Aug 4, 2023

Let us know what you think!:)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment