-
Can Large Language Models Understand Context?
Paper ā¢ 2402.00858 ā¢ Published ā¢ 21 -
OLMo: Accelerating the Science of Language Models
Paper ā¢ 2402.00838 ā¢ Published ā¢ 80 -
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 143 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper ā¢ 2401.17072 ā¢ Published ā¢ 25
Collections
Discover the best community collections!
Collections including paper arxiv:2402.08268
-
World Model on Million-Length Video And Language With RingAttention
Paper ā¢ 2402.08268 ā¢ Published ā¢ 36 -
Improving Text Embeddings with Large Language Models
Paper ā¢ 2401.00368 ā¢ Published ā¢ 79 -
Chain-of-Thought Reasoning Without Prompting
Paper ā¢ 2402.10200 ā¢ Published ā¢ 99 -
FiT: Flexible Vision Transformer for Diffusion Model
Paper ā¢ 2402.12376 ā¢ Published ā¢ 48
-
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling
Paper ā¢ 2401.15977 ā¢ Published ā¢ 36 -
Lumiere: A Space-Time Diffusion Model for Video Generation
Paper ā¢ 2401.12945 ā¢ Published ā¢ 86 -
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Paper ā¢ 2307.04725 ā¢ Published ā¢ 64 -
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Paper ā¢ 2402.01566 ā¢ Published ā¢ 26
-
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper ā¢ 2401.01885 ā¢ Published ā¢ 27 -
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper ā¢ 2401.15687 ā¢ Published ā¢ 21 -
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Paper ā¢ 2312.17172 ā¢ Published ā¢ 26 -
MouSi: Poly-Visual-Expert Vision-Language Models
Paper ā¢ 2401.17221 ā¢ Published ā¢ 7
-
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon
Paper ā¢ 2401.03462 ā¢ Published ā¢ 26 -
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
Paper ā¢ 2305.07185 ā¢ Published ā¢ 9 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper ā¢ 2309.00071 ā¢ Published ā¢ 65 -
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache
Paper ā¢ 2401.02669 ā¢ Published ā¢ 14
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 44 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 157 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 47