--- title: Mutimodal Video Chat RAG emoji: 🦀 colorFrom: purple colorTo: purple sdk: gradio sdk_version: 5.3.0 app_file: app.py pinned: false license: mit short_description: This is a RAG system for chatting with videos --- # Video Chat RAG This is a Retrieval-Augmented Generation (RAG) system for chatting with videos. Upload a video and ask questions about its content! ## Features - Video frame analysis using CLIP - Natural language querying of video content - Frame retrieval with relevance scores - User-friendly interface ## Usage 1. Upload a video using the file upload button 2. Click "Process Video" and wait for processing to complete 3. Type your question in the text box 4. Click "Search" to find relevant frames ## Technical Details The system uses: - CLIP for visual-language understanding - FAISS for efficient similarity search - Gradio for the user interface ## Limitations - Maximum video file size: 500MB - Supported formats: MP4 - Processing time depends on video length