Guangxuan Xiao's picture

3 2 3

Guangxuan Xiao

Guangxuan-Xiao

·

http://guangxuanx.com

Guangxuan-Xiao

AI & ML interests

Efficient Machine Learning

Organizations

Guangxuan-Xiao's activity

commented a paper 25 days ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published 26 days ago • 6 •

New activity in mit-han-lab/opt-13b-smoothquant almost 2 years ago

how to load and use model?

#1 opened almost 2 years ago by