charlescai2016 (Charles Cai)

upvoted a paper about 2 hours ago

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19 • 18

upvoted an article 8 days ago

Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By

•

May 31

• 11

upvoted a paper about 2 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 100

upvoted an article about 2 months ago

Article

Preference Optimization for Vision Language Models

Jul 10

• 41

upvoted a paper about 2 months ago

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 9

upvoted a paper 3 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 61

upvoted an article 3 months ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30

• 59

upvoted 2 papers 3 months ago

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Paper • 2408.01584 • Published Aug 2 • 7

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Paper • 2408.02657 • Published Aug 5 • 32

upvoted a paper 4 months ago

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 40

upvoted 2 articles 4 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 48

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 226

upvoted a collection 4 months ago

FP8 LLMs for vLLM

Collection

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated 27 days ago • 58

upvoted 2 papers 4 months ago

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 85

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29 • 22

upvoted a collection 5 months ago

Graph-enhanced RAG

Collection

using knowledge graphs in RAG for grounding LLM results • 24 items • Updated 22 days ago • 12

upvoted 3 papers 5 months ago

upvoted a paper 6 months ago

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Paper • 2405.11273 • Published May 18 • 17

Charles Cai

AI & ML interests

Organizations

charlescai2016's activity

BOND: Aligning LLMs with Best-of-N Distillation

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Preference Optimization for Vision Language Models

Let's Verify Step by Step

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Memory-efficient Diffusion Transformers with Quanto and Diffusers

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Adding Conditional Control to Text-to-Image Diffusion Models

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Fine-tune Llama 3 with ORPO

FP8 LLMs for vLLM

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Graph-enhanced RAG

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

Teaching Large Language Models to Reason with Reinforcement Learning

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts