Fix the kv-cache dimensions

#47

by cchudant - opened Jun 7, 2023

base: refs/heads/main

←

from: refs/pr/47

Discussion Files changed

-1

cchudant

Jun 7, 2023

Hello!
I have noticed that the dimension of the kv-cache here is weird, and does not match the hugginface transformers modeling_bloom.py file.
Is the departure from the bloom dimension intended?
Judging from the copy-pasted comments, it looks like a bug - also, _convert_to_rw_cache & its _convert_to_standard_cache counterpart matches bloom dimensions.

Fix the kv-cache dimensionsd5ff350e

cchudant

Jun 7, 2023

Upstream modeling_bloom.py: https://github.com/huggingface/transformers/blob/fabe17a726bbf6081cfbcc975d8ac451a81f3e2d/src/transformers/models/bloom/modeling_bloom.py#L305

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment