Ravi-Teja-konda / Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
☆102Updated 5 months ago
Alternatives and similar repositories for Surveillance_Video_Summarizer:
Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below
- ☆51Updated 3 months ago
- Embed anything.☆29Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 8 months ago
- The PyVisionAI Official Repo☆60Updated this week
- 🐮📢 The first AI voice assistant that interrupts *you*☆138Updated 5 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- Testing the different LLM and RAG Tests while I learn along the way☆79Updated last month
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated last month
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 7 months ago
- Roboflow Workflows on ComfyUI☆32Updated 4 months ago
- Generate python documentation using LLMs☆61Updated 7 months ago
- ☆28Updated 11 months ago
- ☆70Updated 3 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 3 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆98Updated 4 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆69Updated 5 months ago
- AutoNL - Natural Language Automation tool☆85Updated 11 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆111Updated 3 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆28Updated 3 months ago
- ☆16Updated 2 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆61Updated 6 months ago
- ☆198Updated 8 months ago
- Turn a fresh Linux installation into a fully configured, sleek, and modern on device AI development system by running a single command.☆54Updated last month
- ☆29Updated 11 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- Chat Bot with LLM and Fact Reference. RAG(Retrieval Augmented Generation) and LangChain backed☆128Updated 9 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated last year