Ravi-Teja-konda / Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
☆107Updated 7 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer:
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
- ☆51Updated 5 months ago
- Embed anything.☆29Updated 11 months ago
- Roboflow Workflows on ComfyUI☆32Updated 7 months ago
- 🐮📢 The first AI voice assistant that interrupts *you*☆140Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆76Updated last week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 8 months ago
- ☆39Updated last year
- ☆130Updated last week
- ☆204Updated 10 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆91Updated 4 months ago
- A repo for generating educational presentation videos.☆24Updated this week
- ☆28Updated last year
- Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a sing…☆166Updated 3 weeks ago
- Testing and evaluation framework for voice agents☆110Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 9 months ago
- Rag Chatbot React And Tyepscript base boilerplate☆33Updated last year
- ☆67Updated 6 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆53Updated 2 months ago
- ☆68Updated 6 months ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆22Updated last month
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆63Updated 8 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated 3 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆117Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆31Updated 5 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated 2 weeks ago
- ☆29Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Get the information of a Github Repository using the power of LLM.☆50Updated last year