[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆41Jul 9, 2024Updated last year
Alternatives and similar repositories for AdaTAD
Users that are interested in AdaTAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability☆28Mar 25, 2024Updated last year
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆323Apr 29, 2025Updated 10 months ago
- ☆18May 6, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- Code release for ActionFormer (ECCV 2022)☆547Apr 11, 2024Updated last year
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆61Dec 6, 2025Updated 3 months ago
- ☆22May 8, 2023Updated 2 years ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆210Dec 27, 2023Updated 2 years ago
- The suite of modeling video with Mamba☆293May 14, 2024Updated last year
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆35Apr 17, 2025Updated 11 months ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆509Nov 18, 2025Updated 4 months ago
- The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)☆42Jun 1, 2023Updated 2 years ago
- Official implementation for ECCV paper "Towards Open Set Video Anomaly Detection"☆16Feb 11, 2023Updated 3 years ago
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆27Jun 26, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…☆25Jul 1, 2025Updated 8 months ago
- Code for CVPR 2019 paper☆12Apr 26, 2019Updated 6 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆763Oct 8, 2024Updated last year
- ☆42Apr 7, 2024Updated last year
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 4 months ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆106Nov 28, 2024Updated last year
- Official implementation of paper AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding☆88Apr 23, 2025Updated 11 months ago
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering (CVPR 2022)☆13Sep 22, 2023Updated 2 years ago
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆28Oct 2, 2025Updated 5 months ago
- ☆11Oct 13, 2024Updated last year
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆97Jan 14, 2025Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- Official Implementation of SnAG (CVPR 2024)☆57Apr 26, 2025Updated 10 months ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆12Oct 8, 2024Updated last year
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆90Jul 2, 2024Updated last year
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Mar 19, 2022Updated 4 years ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,223Dec 15, 2025Updated 3 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆34May 27, 2025Updated 9 months ago
- 🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)☆311Feb 8, 2026Updated last month
- ☆34Jun 2, 2023Updated 2 years ago
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization☆18Sep 28, 2023Updated 2 years ago
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆37Mar 30, 2023Updated 2 years ago