Ravi-Teja-konda / Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
☆98Updated 4 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer:
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
- ☆51Updated 2 months ago
- Turn a fresh Linux installation into a fully configured, sleek, and modern on device AI development system by running a single command.☆44Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 2 months ago
- Embed anything.☆28Updated 7 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆72Updated this week
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆131Updated last month
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826☆55Updated 3 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 6 months ago
- ☆29Updated last month
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆45Updated 4 months ago
- Roboflow Workflows on ComfyUI☆32Updated 3 months ago
- ☆66Updated 2 months ago
- Jockey is a conversational video agent.☆56Updated last week
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated this week
- Serving LLMs in the HF-Transformers format via a PyFlask API☆68Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆81Updated 3 weeks ago
- Routing on Random Forest (RoRF)☆98Updated 3 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆33Updated 2 months ago
- ☆195Updated 7 months ago
- Own your AI, search the web with it🌐😎☆74Updated this week
- ☆84Updated 3 weeks ago
- The PyVisionAI Official Repo☆54Updated this week
- A fork of OpenAI Swarm that supports Groq and Anthropic☆101Updated last month
- A toolkit for building multimodal AI agents☆133Updated this week
- 100% Local Document deep search with LLMs☆25Updated 4 months ago
- Context-aware structured outputs. Search your documents or the web for specific data and get it back in JSON or Markdown.☆139Updated 2 weeks ago
- AutoNL - Natural Language Automation tool☆85Updated 10 months ago
- ☆28Updated 10 months ago
- ☆29Updated 10 months ago