Daily arXiv - a Maxwell-Jia Collection

Maxwell-Jia 's Collections

Daily arXiv

updated Oct 9

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

Paper • 2407.06027 • Published Jul 8 • 8
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 128
Toto: Time Series Optimized Transformer for Observability

Paper • 2407.07874 • Published Jul 10 • 29
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

Paper • 2407.09413 • Published Jul 12 • 9
Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 155
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

Paper • 2407.11895 • Published Jul 16 • 7
Scaling Granite Code Models to 128K Context

Paper • 2407.13739 • Published Jul 18 • 19
Vision language models are blind

Paper • 2407.06581 • Published Jul 9 • 82
Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Paper • 2407.16607 • Published Jul 23 • 21
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains

Paper • 2407.18961 • Published Jul 18 • 38
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25 • 30
SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 107
Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

Paper • 2408.00103 • Published Jul 31 • 16
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM

Paper • 2408.12076 • Published Aug 22 • 11
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction

Paper • 2409.17422 • Published Sep 25 • 24
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case Study

Paper • 2409.17580 • Published Sep 26 • 7
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7 • 19