What hardware do I need for reasonable performance?
#3
by
TS0001
- opened
I think the issue is AutoGPTQ which is slow, but I don't know enough about it, only what I've been reading people say.
I get ~2 t/s on my 3090 with this model which I consider reasonable for the setup (WSL2). :)
What is the fastest way to run this model on GPU?