runtime error

Exit code: 1. Reason: The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable. Traceback (most recent call last): File "/home/user/app/app.py", line 13, in <module> tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename) File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/tokenization_auto.py", line 880, in from_pretrained return tokenizer_class_fast.from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/tokenization_utils_base.py", line 2073, in from_pretrained raise EnvironmentError( OSError: Can't load tokenizer for 'MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF'. If you were trying to load it from 'https://huggingface.co/models', make sure you don't have a local directory with the same name. Otherwise, make sure 'MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF' is the correct path to a directory containing all relevant files for a LlamaTokenizerFast tokenizer.

Container logs:

Fetching error logs...