ponyxl / README.md
nroggendorff's picture
Update README.md
f235b1a verified
|
raw
history blame
1.22 kB
metadata
license: mit

Pony Diffusion XL Model Card

Pony Diffusion XL is a latent text-to-image diffusion model capable of generating images of Horses, mainly, and other things. For more information about how Stable Diffusion functions, please have a look at 🤗's Stable Diffusion blog.

You can use this with the 🧨Diffusers library from Hugging Face.

So pretty, right?

Diffusers

from diffusers import StableDiffusionPipeline
import torch

pipeline = StableDiffusionPipeline.from_pretrained("nroggendorff/ponyxl").to("cuda")

image = pipeline(prompt="a chibi doll").images[0]
image.save("horse.png")

Model Details

  • train_batch_size: 1
  • gradient_accumulation_steps: 4
  • learning_rate: 1e-2
  • lr_warmup_steps: 500
  • mixed_precision: "fp16"
  • eval_metric: "mean_squared_error"

Limitations

  • The model does not achieve perfect photorealism
  • The model cannot render legible text

Developed by

  • Noa Linden Roggendorff

This model card was written by Noa Roggendorff and is based on the Stable Diffusion v1-5 Model Card.