A newer version of the Gradio SDK is available:
5.6.0
metadata
title: Mutimodal Video Chat RAG
emoji: 🦀
colorFrom: purple
colorTo: purple
sdk: gradio
sdk_version: 5.3.0
app_file: app.py
pinned: false
license: mit
short_description: This is a RAG system for chatting with videos
Video Chat RAG
This is a Retrieval-Augmented Generation (RAG) system for chatting with videos. Upload a video and ask questions about its content!
Features
- Video frame analysis using CLIP
- Natural language querying of video content
- Frame retrieval with relevance scores
- User-friendly interface
Usage
- Upload a video using the file upload button
- Click "Process Video" and wait for processing to complete
- Type your question in the text box
- Click "Search" to find relevant frames
Technical Details
The system uses:
- CLIP for visual-language understanding
- FAISS for efficient similarity search
- Gradio for the user interface
Limitations
- Maximum video file size: 500MB
- Supported formats: MP4
- Processing time depends on video length