[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆41Jul 9, 2024Updated last year
Alternatives and similar repositories for AdaTAD
Users that are interested in AdaTAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability☆29Mar 25, 2024Updated 2 years ago
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆330Apr 29, 2025Updated last year
- ☆20May 6, 2024Updated 2 years ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code release for ActionFormer (ECCV 2022)☆559Apr 11, 2024Updated 2 years ago
- ☆22May 8, 2023Updated 3 years ago
- The suite of modeling video with Mamba☆294May 14, 2024Updated 2 years ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆214Dec 27, 2023Updated 2 years ago
- Diagnosing Error in Temporal Action Detectors (ECCV 2018)☆78Nov 14, 2021Updated 4 years ago
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆37Apr 17, 2025Updated last year
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆523Nov 18, 2025Updated 6 months ago
- [NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM☆24Feb 10, 2026Updated 3 months ago
- Official implementation for ECCV paper "Towards Open Set Video Anomaly Detection"☆16Feb 11, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆27Jun 26, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- Code for CVPR 2019 paper☆12Apr 26, 2019Updated 7 years ago
- Pytorch codebase for Capturing label characteristics in VAEs☆13May 1, 2021Updated 5 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆784Oct 8, 2024Updated last year
- ☆42Apr 7, 2024Updated 2 years ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆106Nov 28, 2024Updated last year
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering (CVPR 2022)☆13Sep 22, 2023Updated 2 years ago
- ☆11Oct 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆101Jan 14, 2025Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆585Updated this week
- Official Implementation of SnAG (CVPR 2024)☆59Apr 26, 2025Updated last year
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆32Oct 2, 2025Updated 7 months ago
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆13Oct 8, 2024Updated last year
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆92Jul 2, 2024Updated last year
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Mar 19, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆36May 27, 2025Updated 11 months ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,264Mar 25, 2026Updated last month
- ☆34Jun 2, 2023Updated 2 years ago
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization☆18Sep 28, 2023Updated 2 years ago
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆37Mar 30, 2023Updated 3 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Jan 28, 2024Updated 2 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago