HanZiwww
/

Aeriel

Model card Files Files and versions Community

Aeriel / README.md

HanZiwww's picture

Update README.md

ad4a5de verified 9 months ago

|

3.25 kB

	---
	license: apache-2.0
	tags:
	- sdxl
	- LoRA
	- GPT-SoVITS
	- VITS
	---
	# Aerial

	![](https://image.marswh.top/121323124.png)

	<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/655a156d9d249b4ab35f85db/QuHntOwOHTyeO9I5hFz2p.wav"></audio>

	Welcome to the repository for the Aerial character generation project, a specialized suite that includes implementations of the LoRA model adapted from LyCORIS and the GPT-SoVITS process, designed to generate images and voices of a custom character named Aerial using the Stable Diffusion XL model framework and the VITS voice synthesis model.

	## Overview

	This project aims to expand the capabilities of the Stable Diffusion XL model and VITS model by incorporating tailored models for generating high-quality images and voices of Aerial, a character born from creativity and detailed characterization. By leveraging both the LoRA (Locally Reweighted Attention) mechanism and GPT-SoVITS process, we fine-tune the models on specific datasets that include diverse illustrations, descriptions, and voice samples of Aerial, ensuring that the generated images and voices stay true to the original character's essence.

	## Features

	- Custom Character Generation: Generate unique and high-quality images and voices of Aerial, adhering to the character's established attributes and themes.
	- Fine-Tuned Attention Mechanism: Utilizes the LoRA model, adapted from LyCORIS, for precise control over the character's features in generated images.
	- Advanced Voice Synthesis: Incorporates a custom-trained VITS model using the GPT-SoVITS process for generating lifelike voice samples of Aerial.
	- Creative Freedom: Designed for artists, writers, voice actors, and creators looking to bring their vision of Aerial to life through AI-generated imagery and voice.
	- Seamless Integration: Compatible with both the Stable Diffusion XL and VITS model, allowing for easy integration into existing workflows for comprehensive character creation.

	## Installation & Usage

	### WebUI for Image Generation

	For generating images of Aerial, the project is accessible through a [WebUI](https://github.com/AUTOMATIC1111/stable-diffusion-webui) interface.

	### VITS Model for Voice Generation

	We recommend an api surface, [vits-simple-api](https://github.com/Artrajz/vits-simple-api), to use the GPT-SoVITS model.

	## Contributing

	Contributions to the Aerial character generation project are welcome! If you have suggestions for improvements, new features, or want to contribute data for training, please open an issue or submit a pull request.

	## License

	This project is licensed under the Apache License 2.0 - see the [LICENSE](https://huggingface.co/models?license=license%3Aapache-2.0) file for details.

	## Acknowledgments

	- The LyCORIS team for their pioneering work on the LoRA model.
	- The creators of the GPT-SoVITS process for advancing voice synthesis technology.
	- The Stable Diffusion and VITS model creators for providing robust foundations for AI-generated art and voice.

	## Disclaimer

	This project is for educational and artistic purposes only. Please ensure that all generated content adheres to applicable laws and respects the intellectual property rights of others.