Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆13Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
Users that are interested in Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning are comparing it to the libraries listed below
Sorting:
- An automated anomaly detection system on realtime CCTV videos using Deep Learning.☆20Jun 19, 2023Updated 2 years ago
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆38Apr 5, 2023Updated 2 years ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year
- ☆34Oct 22, 2025Updated 4 months ago
- Code for "Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via Optimization Trajectory Distillation", ICCV 2023☆16Aug 31, 2023Updated 2 years ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 9 months ago
- Demo code for CVPR 2016 paper: Learning a Discriminative Null Space for Person Re-identification☆13May 23, 2018Updated 7 years ago
- Cascade Pose Regression by global tunning☆12May 22, 2015Updated 10 years ago
- ☆11Jan 4, 2023Updated 3 years ago
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆21Dec 11, 2025Updated 2 months ago
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- Just a simple program that uses the ElevenLabs text to speech AI. All it does is take a string and output the audio.☆10Feb 19, 2023Updated 3 years ago
- Extract text data from documents using OCR (optical character recognition) technology and NER (named entity recognition).☆10May 11, 2023Updated 2 years ago
- ☆18Jun 25, 2023Updated 2 years ago
- [ICIP2021] The official PyTorch implementation of MASK GUIDED ATTENTION FOR FINE-GRAINED PATCHY IMAGE CLASSIFICATION☆11Nov 3, 2021Updated 4 years ago
- Code to reproduce 'MOCCA: Multi-Layer One-Class Classification for Anomaly Detection'☆10Dec 12, 2021Updated 4 years ago
- Code release for Grad-CAM Guided Attention Module for Fine-grained Visual Classification (MLSP 2022)☆13Aug 25, 2021Updated 4 years ago
- Image Action Recognition system that uses the Stanford40 dataset☆11Aug 15, 2023Updated 2 years ago
- ☆10Mar 30, 2022Updated 3 years ago
- ☆51Jun 19, 2024Updated last year
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Jun 1, 2020Updated 5 years ago
- Code for the CubeRefine R-CNN model of our CVPRW '23 paper "Parcel3D: Shape Reconstruction From Single RGB Images for Applications in Tra…☆16Jul 12, 2023Updated 2 years ago
- ☆15Jul 9, 2019Updated 6 years ago
- code for "GLEN: General-Purpose Event Detection for Thousands of Types"☆13Nov 6, 2023Updated 2 years ago
- One-Shot Face Recognition Using Siamese Neural Networks☆14Mar 15, 2020Updated 5 years ago
- ☆13Aug 27, 2021Updated 4 years ago
- ☆10Mar 26, 2024Updated last year
- Korean ASR using PyTorch / Listen, Attend and Spell (LAS) / Seq2seq with Attention / Naver-A.I-Hackathon-Speech / A.I Hub Dataset / 한국…☆12Feb 10, 2020Updated 6 years ago
- SODA: Story Oriented Dense Video Captioning Evaluation Framework☆13May 3, 2024Updated last year
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆15Dec 10, 2022Updated 3 years ago
- Official project page of the paper "Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges" (Accep…☆71Apr 30, 2024Updated last year
- Skipping Recurrent Neural Networks☆13Aug 3, 2018Updated 7 years ago
- This project is an implementation of fine-tuning an SDXL model using DreamBooth and LoRA on custom data of interior rooms to generate des…☆11Feb 8, 2024Updated 2 years ago
- A tool for taking screenshots. It's usable for logging tests.☆25Dec 3, 2017Updated 8 years ago
- Official code for "FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation"☆15Jul 13, 2024Updated last year
- ☆15May 13, 2024Updated last year
- Video summarization using Vision Transformers☆13Feb 5, 2023Updated 3 years ago