SDXL 1.0 finetunes on vucinatim/spectrogram-captions for 89 epochs(800 steps). It generates spectrograms for simple sounds. It currently does not produce very good sound effects, but I will train the model for longer in the future.
- Downloads last month
- 8
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.