Adit31 / Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning
Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
☆14Updated last year
Alternatives and similar repositories for Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning:
Users that are interested in Captionomaly-Deep-Learning-Toolbox-for-Anomaly-Captioning are comparing it to the libraries listed below
- ☆20Updated 2 years ago
- ☆29Updated 3 years ago
- Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)☆75Updated 3 years ago
- Pytorch implementation for "Progressive Video Summarization via Multimodal Self-supervised Learning"☆33Updated 2 years ago
- The official implementation of 'Align and Attend: Multimodal Summarization with Dual Contrastive Losses' (CVPR 2023)☆75Updated last year
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆14Updated 2 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆38Updated 2 years ago
- Fusional approaches for temporal action localization in untrimmed videos☆36Updated 2 years ago
- ☆44Updated last year
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆18Updated last year
- ☆33Updated last year
- [CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.☆52Updated 2 years ago
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆29Updated last year
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62Updated 2 years ago
- I3D feature extractor☆44Updated 5 years ago
- The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).☆13Updated 2 years ago
- Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …☆28Updated 4 years ago
- [ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos☆120Updated last year
- Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"☆107Updated 3 years ago
- [CVPR 2025] Official implementation of "Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity"☆38Updated this week
- Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval☆28Updated 3 weeks ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆47Updated last year
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆31Updated last year
- Official code for "Learning Prompt-Enhanced Context features for Weakly-Supervised Video Anomlay Detection" (IEEE-TIP)☆83Updated 7 months ago
- https://layer6ai-labs.github.io/xpool/☆122Updated last year
- A Pytorch implementation of "describing videos by exploiting temporal structure", ICCV 2015☆48Updated 2 years ago
- Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"☆51Updated 3 years ago
- Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).☆47Updated 2 years ago
- Weakly Supervised Video Moment Localisation with Contrastive Negative Sample Mining☆26Updated 2 years ago
- A Video-to-Text Framework☆10Updated last year