Half precision

by ljhwild - opened Jun 20

Discussion

ljhwild

Jun 20

Is this compatible out of the box with half precision or quantizations as opposed to unbables library implementation?

vince62s

Owner Jun 20

as you can see the model size is 7GB which for a 3.5G params is FP16.
but you can achive the same with the Unbabel model by changing two lines of code.

vince62s changed discussion status to closed Jun 20

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment