superceded bloom int8 optimized model doesnt do sampling
#8
by
xaq
- opened
As the title says. I am glad to have found this model.
https://github.com/huggingface/transformers/issues/19445
Anything but plain "greedy" search "not implemented for 'Half'" #19445