RwkvForCausalLM does not support gradient checkpointing

#1
by nhanv - opened

I hope the team can help with this issue soon

Sign up or log in to comment