Quantization Performances
#4
by
AutomaticHourglass
- opened
What are the quantization performances? Is it ok to use q8 or we should only use the fp16?
What are the quantization performances? Is it ok to use q8 or we should only use the fp16?