[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆41Jul 9, 2024Updated last year
Alternatives and similar repositories for AdaTAD
Users that are interested in AdaTAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability☆27Mar 25, 2024Updated 2 years ago
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆324Apr 29, 2025Updated 11 months ago
- ☆18May 6, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code release for ActionFormer (ECCV 2022)☆555Apr 11, 2024Updated 2 years ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆61Dec 6, 2025Updated 4 months ago
- ☆22May 8, 2023Updated 2 years ago
- The suite of modeling video with Mamba☆294May 14, 2024Updated last year
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆212Dec 27, 2023Updated 2 years ago
- Diagnosing Error in Temporal Action Detectors (ECCV 2018)☆77Nov 14, 2021Updated 4 years ago
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆37Apr 17, 2025Updated 11 months ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆516Nov 18, 2025Updated 4 months ago
- [NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM☆23Feb 10, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation for ECCV paper "Towards Open Set Video Anomaly Detection"☆16Feb 11, 2023Updated 3 years ago
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆27Jun 26, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…☆27Jul 1, 2025Updated 9 months ago
- Code for CVPR 2019 paper☆12Apr 26, 2019Updated 6 years ago
- Pytorch codebase for Capturing label characteristics in VAEs☆12May 1, 2021Updated 4 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆771Oct 8, 2024Updated last year
- ☆42Apr 7, 2024Updated 2 years ago
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆106Nov 28, 2024Updated last year
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering (CVPR 2022)☆13Sep 22, 2023Updated 2 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆29Oct 2, 2025Updated 6 months ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆98Jan 14, 2025Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- Official Implementation of SnAG (CVPR 2024)☆58Apr 26, 2025Updated 11 months ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆90Jul 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Mar 19, 2022Updated 4 years ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆34May 27, 2025Updated 10 months ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,241Mar 25, 2026Updated 2 weeks ago
- 🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)☆318Feb 8, 2026Updated 2 months ago
- ☆34Jun 2, 2023Updated 2 years ago
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization☆18Sep 28, 2023Updated 2 years ago
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆37Mar 30, 2023Updated 3 years ago