view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • 16 days ago • 8
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published 23 days ago • 74
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper • 2410.13824 • Published 23 days ago • 29
LongEmbed: Extending Embedding Models for Long Context Retrieval Paper • 2404.12096 • Published Apr 18 • 2
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper • 2309.10400 • Published Sep 19, 2023 • 26