ValueError: `rope_scaling` must be a dictionary with with two fields

#131

by layor - opened Sep 16

Sep 16

When deploying my fine tuned model on a dedicated inference endpoint on hugging face this gets triggered:

ValueError: rope_scaling must be a dictionary with with two fields, type and factor, got {'factor': 8.0, 'low_freq_factor': 1.0, 'high_freq_factor': 4.0, 'original_max_position_embeddings': 8192, 'rope_type': 'llama3'}
Application startup failed. Exiting.

I can't access the config.json file since its on the base model so no way to modify these value or to upgrade transformers (since its deploying on huggingface servers).

Any idea what to do?