yuyijiong's picture

yuyijiong

yuyijiong

·

yuyijiong

AI & ML interests

NLP, sentiment analyze, long context model

Organizations

yuyijiong's activity

upvoted a paper about 1 month ago

Hyper-multi-step: The Truth Behind Difficult Long-context Tasks

Paper • 2410.04422 • Published Oct 6 • 7

upvoted a collection 3 months ago

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Sep 18 • 45

upvoted 2 collections 4 months ago

MAmmoTH

The datasets and models for the MAmmoTH project • 9 items • Updated Apr 19 • 2

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 192

upvoted a paper 4 months ago

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Paper • 2407.02490 • Published Jul 2 • 23