TMT's picture

TMT

JJ-TMT

·

AI & ML interests

None yet

Organizations

None yet

JJ-TMT's activity

upvoted 2 papers 21 days ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published 28 days ago • 22

Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning

Paper • 2410.19290 • Published 25 days ago • 10

upvoted a paper 25 days ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published 28 days ago • 58

upvoted a paper 26 days ago

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11 • 41

upvoted 4 papers 27 days ago

Does Spatial Cognition Emerge in Frontier Models?

Paper • 2410.06468 • Published Oct 9 • 2

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

Benchmarking Agentic Workflow Generation

Paper • 2410.07869 • Published Oct 10 • 25

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published about 1 month ago • 48

upvoted a paper 2 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17 • 71

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 352

upvoted a paper 2 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 72

upvoted 2 papers 3 months ago

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline

Paper • 2408.15079 • Published Aug 27 • 52

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22 • 50

upvoted a collection 3 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 198

upvoted 2 papers 3 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8 • 9

upvoted 3 papers 4 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 128

Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion

Paper • 2407.13759 • Published Jul 18 • 17

Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public Data

Paper • 2407.08726 • Published Jul 11 • 8

upvoted a paper 5 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24 • 57