Ravi-Teja-konda / Surveillance_Video_SummarizerLinks
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
โ125Updated 6 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
Sorting:
- ๐ฎ๐ข The first AI voice assistant that interrupts *you*โ148Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioโฆโ84Updated last year
- โ107Updated last month
- Using the moondream VLM with optical flow for promptable object trackingโ71Updated 9 months ago
- Embed anything.โ27Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchโ103Updated 11 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluationโ108Updated 11 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ๐๏ธโ88Updated 2 years ago
- Inference and fine-tuning examples for vision models from ๐ค Transformersโ163Updated 4 months ago
- Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a singโฆโ180Updated 8 months ago
- โ207Updated last year
- Tiny client for LLMs with vision and tool calling. As simple as it gets.โ88Updated 11 months ago
- Jockey is a conversational video agent.โ93Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingfaceโ96Updated last year
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.โ25Updated last week
- An extension of the previous 'Fitness-AI-Coach': a complete web application with real-time exercise recognition and counting. The exercisโฆโ118Updated 5 months ago
- Notebooks using the Neural Magic libraries ๐โ39Updated last year
- โ84Updated last year
- โ101Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.โ22Updated last year
- โ40Updated last year
- No longer maintained:Your personal ArXiv Curatorโ41Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understโฆโ23Updated 8 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.reโฆโ46Updated last year
- โ19Updated 6 months ago
- Take your LLM to the optometrist.โ42Updated 2 weeks ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDBโ123Updated last year
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answeโฆโ76Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsโ34Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask APIโ71Updated last year