[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆42Jul 9, 2024Updated last year
Alternatives and similar repositories for AdaTAD
Users that are interested in AdaTAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆334Apr 29, 2025Updated last year
- ☆20May 6, 2024Updated 2 years ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆19Oct 3, 2024Updated last year
- Code release for ActionFormer (ECCV 2022)☆563Apr 11, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆65Dec 6, 2025Updated 6 months ago
- ☆22May 8, 2023Updated 3 years ago
- The suite of modeling video with Mamba☆295May 14, 2024Updated 2 years ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆217Dec 27, 2023Updated 2 years ago
- Diagnosing Error in Temporal Action Detectors (ECCV 2018)☆78Nov 14, 2021Updated 4 years ago
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆39Apr 17, 2025Updated last year
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆527Nov 18, 2025Updated 6 months ago
- The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)☆43Jun 1, 2023Updated 3 years ago
- [NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM☆25Feb 10, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆30Jun 26, 2024Updated last year
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Sep 25, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…☆29Jul 1, 2025Updated 11 months ago
- Pytorch codebase for Capturing label characteristics in VAEs☆13May 1, 2021Updated 5 years ago
- ☆42Apr 7, 2024Updated 2 years ago
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 6 months ago
- Official implementation of paper AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding☆91Apr 21, 2026Updated last month
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability☆107Nov 28, 2024Updated last year
- Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering (CVPR 2022)☆13Sep 22, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Oct 13, 2024Updated last year
- codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval☆20Sep 7, 2020Updated 5 years ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆101Jan 14, 2025Updated last year
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆589May 29, 2026Updated 2 weeks ago
- Official Implementation of SnAG (CVPR 2024)☆60Apr 26, 2025Updated last year
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆33Oct 2, 2025Updated 8 months ago
- [ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization☆13Oct 8, 2024Updated last year
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆92Jul 2, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Mar 19, 2022Updated 4 years ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,281May 26, 2026Updated 2 weeks ago
- ☆34Jun 2, 2023Updated 3 years ago
- 🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)☆340Feb 8, 2026Updated 4 months ago
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization☆18Sep 28, 2023Updated 2 years ago
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆37Mar 30, 2023Updated 3 years ago
- Placeholder☆10Jul 17, 2023Updated 2 years ago