Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published about 1 month ago • 44
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5 • 60
Efficient Failure Pattern Identification of Predictive Algorithms Paper • 2306.00760 • Published Jun 1, 2023 • 1
Bellman Optimal Step-size Straightening of Flow-Matching Models Paper • 2312.16414 • Published Dec 27, 2023 • 1