capradeepgujaran's picture
Update README.md
5a09cf2 verified
|
raw
history blame
1.01 kB
---
title: Mutimodal Video Chat RAG
emoji: 🦀
colorFrom: purple
colorTo: purple
sdk: gradio
sdk_version: 5.3.0
app_file: app.py
pinned: false
license: mit
short_description: This is a RAG system for chatting with videos
---
# Video Chat RAG
This is a Retrieval-Augmented Generation (RAG) system for chatting with videos. Upload a video and ask questions about its content!
## Features
- Video frame analysis using CLIP
- Natural language querying of video content
- Frame retrieval with relevance scores
- User-friendly interface
## Usage
1. Upload a video using the file upload button
2. Click "Process Video" and wait for processing to complete
3. Type your question in the text box
4. Click "Search" to find relevant frames
## Technical Details
The system uses:
- CLIP for visual-language understanding
- FAISS for efficient similarity search
- Gradio for the user interface
## Limitations
- Maximum video file size: 500MB
- Supported formats: MP4
- Processing time depends on video length