Last Week in Medical AI: Top Research Papers/Models π (September 21 - September 27, 2024) Sep 28 β’ 2
Performance Comparison: Llama-3.2 vs. Llama-3.1 LLMs and Smaller Models (3B, 1B) in Medical and Healthcare AI Domains π©Ίπ§¬π Sep 26 β’ 5
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper β’ 2408.08872 β’ Published Aug 16 β’ 97
TraDiffusion: Trajectory-Based Training-Free Image Generation Paper β’ 2408.09739 β’ Published Aug 19 β’ 7
Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges Paper β’ 2408.08946 β’ Published Aug 16 β’ 10
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data Paper β’ 2408.10119 β’ Published Aug 19 β’ 15
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views Paper β’ 2408.10195 β’ Published Aug 19 β’ 12
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Paper β’ 2408.10198 β’ Published Aug 19 β’ 32
MambaEVT: Event Stream based Visual Object Tracking using State Space Model Paper β’ 2408.10487 β’ Published Aug 20 β’ 5
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model Paper β’ 2408.10764 β’ Published Aug 20 β’ 7
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Paper β’ 2408.11049 β’ Published Aug 20 β’ 11
NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency Paper β’ 2408.11054 β’ Published Aug 20 β’ 10
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning Paper β’ 2408.11001 β’ Published Aug 20 β’ 11
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper β’ 2408.11039 β’ Published Aug 20 β’ 56
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering Paper β’ 2408.09174 β’ Published Aug 17 β’ 51
Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification Paper β’ 2408.11237 β’ Published Aug 20 β’ 4
Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer Paper β’ 2408.08793 β’ Published Aug 16 β’ 4
FRAP: Faithful and Realistic Text-to-Image Generation with Adaptive Prompt Weighting Paper β’ 2408.11706 β’ Published Aug 21 β’ 5
TrackGo: A Flexible and Efficient Method for Controllable Video Generation Paper β’ 2408.11475 β’ Published Aug 21 β’ 16
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models Paper β’ 2408.11817 β’ Published Aug 21 β’ 7