Ravi-Teja-konda / Surveillance_Video_Summarizer

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
โ˜†91Updated 2 months ago

Related projects โ“˜

Alternatives and complementary repositories for Surveillance_Video_Summarizer