Ravi-Teja-konda / Surveillance_Video_Summarizer
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.
โ91Updated 2 months ago
Related projects โ
Alternatives and complementary repositories for Surveillance_Video_Summarizer
- โ56Updated 3 weeks ago
- ๐ฎ๐ข The first AI voice assistant that interrupts *you*โ130Updated 2 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.โ78Updated last month
- Gradio based tool to run opensource LLM models directly from Huggingfaceโ87Updated 4 months ago
- Embed anything.โ29Updated 5 months ago
- โ56Updated last month
- 100% Local Document deep search with LLMsโ25Updated 2 months ago
- A toolkit for building multimodal AI agentsโ111Updated this week
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligโฆโ43Updated 3 months ago
- โ25Updated last month
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with useโฆโ43Updated 3 months ago
- โ112Updated this week
- Routing on Random Forest (RoRF)โ84Updated last month
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)โ64Updated this week
- โ98Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioโฆโ77Updated 5 months ago
- Context-aware structured outputs. Search your documents or the web for specific data and get it back in JSON or Markdown.โ119Updated this week
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money โฆโ65Updated last month
- ๐ค Headless IDE for AI agentsโ133Updated this week
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826โ52Updated last month
- 2024 LlamaIndex RAG Hackathon "1st Place Award" Projectโ65Updated 9 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropicโ85Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchโ49Updated this week
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the โฆโ34Updated 3 weeks ago
- โ28Updated 8 months ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timmโฆโ119Updated this week
- Open-source Python toolkit focused on deep learning with ordinal methodologiesโ31Updated 3 weeks ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It proโฆโ62Updated 2 weeks ago
- AutoNL - Natural Language Automation toolโ83Updated 8 months ago
- A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQLโ46Updated last week