Jarrel Seah
jarrelscy
AI & ML interests
None yet
Organizations
None yet
jarrelscy's activity
I am running in vllm 0.4.1 with 4 x gpus 24gb (A10G 24gb) = 96gb and eager mode and I am still out of memory, how? it should fit (like 87gb vram)
1
#3 opened 7 months ago
by
orel12
KeyError: 'model.layers.45.block_sparse_moe.gate.g_idx'
5
#2 opened 7 months ago
by
tutu329
KeyError: 'model.layers.45.block_sparse_moe.gate.g_idx'
5
#2 opened 7 months ago
by
tutu329