[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆40Jul 9, 2024Updated last year
Alternatives and similar repositories for AdaTAD
Users that are interested in AdaTAD are comparing it to the libraries listed below
Sorting:
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆319Apr 29, 2025Updated 10 months ago
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- Code release for ActionFormer (ECCV 2022)☆542Apr 11, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆55Jul 5, 2024Updated last year
- The suite of modeling video with Mamba☆290May 14, 2024Updated last year
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆28Oct 2, 2025Updated 5 months ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆511Nov 18, 2025Updated 3 months ago
- ☆34Jun 2, 2023Updated 2 years ago
- ☆18Aug 19, 2024Updated last year
- Diagnosing Error in Temporal Action Detectors (ECCV 2018)☆75Nov 14, 2021Updated 4 years ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 8 months ago
- ☆20Sep 5, 2024Updated last year
- ☆42Apr 7, 2024Updated last year
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 3 months ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆97Jan 14, 2025Updated last year
- The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)☆42Jun 1, 2023Updated 2 years ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆61Dec 6, 2025Updated 2 months ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆206Dec 27, 2023Updated 2 years ago
- Official Implementation of SnAG (CVPR 2024)☆56Apr 26, 2025Updated 10 months ago
- [CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…☆26Jul 1, 2025Updated 8 months ago
- ☆22May 8, 2023Updated 2 years ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆106Nov 28, 2024Updated last year
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆108May 29, 2025Updated 9 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆33May 27, 2025Updated 9 months ago
- Video shot transition detection☆25Mar 9, 2023Updated 2 years ago
- UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection☆101Sep 29, 2022Updated 3 years ago
- ☆26Mar 20, 2023Updated 2 years ago
- ☆21May 11, 2025Updated 9 months ago
- [CVPR 2024] Context-Guided Spatio-Temporal Video Grounding☆66Jun 28, 2024Updated last year
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆752Oct 8, 2024Updated last year
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆34Apr 17, 2025Updated 10 months ago
- [CVPR'24] RTracker: Recoverable Tracking via PN Tree Structured Memory☆28Jun 18, 2024Updated last year
- Narrative movie understanding benchmark☆76Jun 11, 2025Updated 8 months ago
- ☆30Jul 4, 2024Updated last year
- A fullstack Rust + React chat app using open-source Llama language models☆33Sep 8, 2023Updated 2 years ago