superceded bloom int8 optimized model doesnt do sampling

#8
by xaq - opened

As the title says. I am glad to have found this model.
https://github.com/huggingface/transformers/issues/19445
Anything but plain "greedy" search "not implemented for 'Half'" #19445

Sign up or log in to comment