flash_attn requirement prevents loading on macos
#6
by
bghira
- opened
Running a 128GB unified arch M3 Max and I cannot load the pipeline due to flash_attn not working on Apple MPS.
Will be updating the model shortly with the official gemma bugfixes - and lets see if that works.
shortly? :)