Ravi-Teja-konda / Surveillance_Video_SummarizerLinks
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
☆114Updated 8 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
Sorting:
- ☆75Updated 2 weeks ago
- An extension of the previous 'Fitness-AI-Coach': a complete web application with real-time exercise recognition and counting. The exercis…☆87Updated this week
- AutoNL - Natural Language Automation tool☆85Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated 4 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆57Updated 3 months ago
- ☆77Updated 7 months ago
- ☆39Updated last year
- 100% Local Document deep search with LLMs☆26Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 11 months ago
- ☆18Updated 3 weeks ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆35Updated last week
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 5 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆26Updated 3 months ago
- This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augm…☆32Updated 5 months ago
- Roboflow Workflows on ComfyUI☆34Updated 8 months ago
- ☆71Updated 8 months ago
- Embed anything.☆28Updated last year
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆42Updated 9 months ago
- A project that enables identification and classification of an intent of a message with dynamic labels☆39Updated 5 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆33Updated 6 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆117Updated 8 months ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆146Updated 9 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 8 months ago
- The next evolution of Agents☆48Updated last week
- Rag Chatbot React And Tyepscript base boilerplate☆33Updated last year
- Deploy Apollo HF space locally☆40Updated 5 months ago
- Personal project, Generative AI, Streamlit, Python☆52Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated last year
- Agentic RAG to help you build a startup🚀☆44Updated 2 months ago
- No longer maintained:Your personal ArXiv Curator☆39Updated 6 months ago