This model contains the weights of NExT-GPT covering text-image-video-audio (tiva), which is built upon

1. Vicuna-7B with version 0
1. ImageBind
1. Stable Diffusion with version v1-5.
1. AudioLDM with version l-full.
1. ZeroScope with version v2_576w.

For more details about the usage of the model, please refer to our code repository.

Downloads last month: 404

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

ChocoWu
/

nextgpt_7b_tiva_v0

Spaces using ChocoWu/nextgpt_7b_tiva_v0 3