capradeepgujaran's picture
Update README.md
5a09cf2 verified
|
raw
history blame
1.01 kB
metadata
title: Mutimodal Video Chat RAG
emoji: 🦀
colorFrom: purple
colorTo: purple
sdk: gradio
sdk_version: 5.3.0
app_file: app.py
pinned: false
license: mit
short_description: This is a RAG system for chatting with videos

Video Chat RAG

This is a Retrieval-Augmented Generation (RAG) system for chatting with videos. Upload a video and ask questions about its content!

Features

  • Video frame analysis using CLIP
  • Natural language querying of video content
  • Frame retrieval with relevance scores
  • User-friendly interface

Usage

  1. Upload a video using the file upload button
  2. Click "Process Video" and wait for processing to complete
  3. Type your question in the text box
  4. Click "Search" to find relevant frames

Technical Details

The system uses:

  • CLIP for visual-language understanding
  • FAISS for efficient similarity search
  • Gradio for the user interface

Limitations

  • Maximum video file size: 500MB
  • Supported formats: MP4
  • Processing time depends on video length