arxiv:2408.07009

Imagen 3

Published on Aug 13

· Submitted by

akhaliq on Aug 14

#2 Paper of the day

Upvote

Authors:

Jason Baldridge ,

Kelvin Chan ,

Yichang Chen ,

Yuqing Du ,

Zach Eaton-Rosen ,

Hongliang Fei ,

Alex Haig ,

Hexiang Hu ,

Tobenna Peter Igwe

Abstract

We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.

View arXiv page View PDF Add to collection

Community

akhaliq

Paper submitter Aug 14

alfredplpl

Aug 14

I cannot find any implementation details. I think Imagen 3 is harmful without control. However, I’m sad not to release the details as an engineer.

MonsterMMORPG

Aug 14

Google is as always lying and showing off

Unless they publish weights and we can try I say SOTA is FLUX

Here evidence

https://youtu.be/bupRePUOA18?si=abhlVZ-COMp_TMan

iperov

Aug 14

no one has seen imagen1 and imagen2. They are just pictures in pdf

MonsterMMORPG

Aug 14

so true. 0 reason to believe google at this point. hack i dont believe any of their AI claims. so far Gemini was only a joke for me

mkaichristensen

Aug 14

Total nothingburger of a paper. Bunch of unreplicatable user studies claiming everyone likes Imagen 3 better than other models, and a lot of waffling about safety and fairness. No implementation details, no info about the dataset or training, nothing. Basically just a PR stunt.