Corvius's picture

Corvius

Corvius

AI & ML interests

None yet

Organizations

None yet

Corvius's activity

upvoted a paper 4 months ago

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 33

upvoted a collection 4 months ago

Gemma 2 Release

15 items • Updated Sep 9 • 193

upvoted 2 collections 5 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 83

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 10 days ago • 157

upvoted a collection 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 682

upvoted a paper 8 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

upvoted a paper 12 months ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118

upvoted 4 papers over 1 year ago

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

Paper • 2307.02053 • Published Jul 5, 2023 • 23

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 82

Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 142