Ravi-Teja-konda/Surveillance_Video_Summarizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ravi-Teja-konda/Surveillance_Video_Summarizer)

Ravi-Teja-konda / Surveillance_Video_Summarizer

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.

☆135

Alternatives and similar repositories for Surveillance_Video_Summarizer

Users that are interested in Surveillance_Video_Summarizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aminK8 / KnobGen
View on GitHub
CVPR 2025 Workshop on CVEU.
☆42Jun 12, 2025Updated last year
UKPLab / 5pils
View on GitHub
Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…
☆45Dec 6, 2025Updated 7 months ago
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Updated this week
JHW5981 / AceParse
View on GitHub
AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing
☆44Sep 17, 2024Updated last year
chen37058 / Physical-Attacks-in-Embodied-Nav
View on GitHub
The official implementation for "Towards Physically Realizable Adversarial Attacks in Embodied Vision Navigation(IROS 2025)"
☆26Mar 3, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PragmaticMachineLearning / docai
View on GitHub
Structured information extraction from documents
☆316May 3, 2026Updated 2 months ago
fb82 / MiHo
View on GitHub
Image matching with MOP+MiHo+NCC
☆64Jul 12, 2026Updated 2 weeks ago
uclaml / COPS
View on GitHub
The official implementation of Cross-Task Experience Sharing (COPS)
☆29Oct 23, 2024Updated last year
hahamyt / clickattention
View on GitHub
ClickAttention: Click Region Similarity Guided Interactive Segmentation
☆23Jan 3, 2025Updated last year
moucheng2017 / SOP-LVM-ICL-Ensemble
View on GitHub
[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…
☆23Mar 16, 2025Updated last year
orionw / promptriever
View on GitHub
The first dense retrieval model that can be prompted like an LM
☆93May 8, 2025Updated last year
stickystyle / ZorkGPT
View on GitHub
Teaching AI to play the classic text adventure Zork using Large Language Models
☆37Apr 5, 2026Updated 3 months ago
KhoomeiK / interrupting-cow
View on GitHub
🐮📢 The first AI voice assistant that interrupts *you*
☆148Sep 6, 2024Updated last year
LAGoM-NLP / transtokenizer
View on GitHub
☆57Dec 27, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ConfeitoHS / arcle
View on GitHub
A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)
☆73Aug 30, 2024Updated last year
BlueDyee / TF-GPH
View on GitHub
(AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing
☆61Dec 17, 2024Updated last year
calmstate / Itinerant
View on GitHub
A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.
☆19Aug 30, 2024Updated last year
GenRobo / MatMamba
View on GitHub
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆64Nov 21, 2024Updated last year
quentin-r37 / sortify-ai
View on GitHub
☆57Feb 18, 2025Updated last year
Hao840 / ADEM-VL
View on GitHub
PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"
☆21Oct 28, 2024Updated last year
znfgnu / easy-agent
View on GitHub
Simple agent framework using Ollama tool calling
☆10Aug 27, 2024Updated last year
HKUDS / XRec
View on GitHub
[EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation"
☆170Sep 24, 2024Updated last year
Beckschen / LLaVolta
View on GitHub
[NeurIPS 2024] Efficient Large Multi-modal Models via Visual Context Compression
☆66Feb 19, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tercumantanumut / GameCompanionAI
View on GitHub
Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…
☆55Sep 30, 2024Updated last year
fajrmn / kokoro-on-browser
View on GitHub
☆16Feb 1, 2025Updated last year
sshh12 / llm-chat-web-ui
View on GitHub
LLM Chat is an open-source serverless alternative to ChatGPT.
☆36Sep 13, 2024Updated last year
Cerebras / DocChat
View on GitHub
GPT-4 Level Conversational QA Trained In a Few Hours
☆69Aug 21, 2024Updated last year
aiser-team / cabrnet
View on GitHub
CaBRNet - Case-Based Reasoning Networks made simple
☆22Jul 22, 2026Updated last week
Fraunhofer-IIS / ODAQ
View on GitHub
A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.
☆53Feb 24, 2026Updated 5 months ago
catena-labs / moa-llm
View on GitHub
A Python library to orchestrate LLMs in a neural network-inspired structure
☆54Oct 4, 2024Updated last year
QuixiAI / dolphin-logger
View on GitHub
☆107Nov 1, 2025Updated 8 months ago
hyperfocAIs / Attend
View on GitHub
Attend - to what matters.
☆17Feb 22, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
LuminosityX / MM-Forecast
View on GitHub
Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".
☆18Apr 16, 2025Updated last year
jkallini / mrt5
View on GitHub
Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."
☆59Sep 25, 2025Updated 10 months ago
fanqiwan / FuseAI
View on GitHub
FuseAI Project
☆600Jan 25, 2025Updated last year
nttmdlab-nlp / InstructDoc
View on GitHub
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
☆162May 31, 2024Updated 2 years ago
jxiw / MambaInLlama
View on GitHub
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆243Oct 14, 2025Updated 9 months ago
boneylizard / Eloquent
View on GitHub
The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…
☆64Updated this week