Ravi-Teja-konda / Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
โ105Updated 6 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer:
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
- ๐ฎ๐ข The first AI voice assistant that interrupts *you*โ140Updated 6 months ago
- Embed anything.โ29Updated 10 months ago
- โ51Updated 4 months ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceโ91Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioโฆโ80Updated 9 months ago
- โ201Updated 9 months ago
- Jockey is a conversational video agent.โ74Updated last month
- An extension of the previous 'Fitness-AI-Coach': a complete web application with real-time exercise recognition and counting. The exercisโฆโ66Updated 2 months ago
- A project that enables identification and classification of an intent of a message with dynamic labelsโ36Updated 3 months ago
- โ68Updated 5 months ago
- Inference and fine-tuning examples for vision models from ๐ค Transformersโ70Updated this week
- โ99Updated 6 months ago
- Roboflow Workflows on ComfyUIโ32Updated 6 months ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understโฆโ22Updated last week
- Serving LLMs in the HF-Transformers format via a PyFlask APIโ71Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchโ86Updated 3 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.โ22Updated 6 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with useโฆโ48Updated 7 months ago
- Data Processor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool caโฆโ159Updated 2 weeks ago
- Personal project, Generative AI, Streamlit, Pythonโ51Updated last month
- No longer maintained:Your personal ArXiv Curatorโ38Updated 4 months ago
- AutoNL - Natural Language Automation toolโ85Updated last year
- 100% Local Document deep search with LLMsโ26Updated 6 months ago
- Testing and evaluation framework for voice agentsโ98Updated last month
- Rag Chatbot React And Tyepscript base boilerplateโ33Updated 11 months ago
- The PyVisionAI Official Repoโ99Updated 2 weeks ago
- Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interfaceโ16Updated this week
- The agentic video editing frameworkโ101Updated last month
- โ36Updated last month
- Using the moondream VLM with optical flow for promptable object trackingโ51Updated last month