sming256/AdaTAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sming256/AdaTAD)

sming256 / AdaTAD

[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

☆42

Alternatives and similar repositories for AdaTAD

Users that are interested in AdaTAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dingfengshi / tridetplus
View on GitHub
Code for the paper, Temporal Action Localization with Enhanced Instant Discriminability
☆29Mar 25, 2024Updated 2 years ago
sming256 / OpenTAD
View on GitHub
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
☆340Jul 14, 2026Updated last week
yingsen1 / UniMD
View on GitHub
UniMD: Towards Unifying Moment retrieval and temporal action Detection
☆57Jul 5, 2024Updated 2 years ago
Alvin-Zeng / temporal-robustness-benchmark
View on GitHub
☆20May 6, 2024Updated 2 years ago
sming256 / ETAD
View on GitHub
[CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection
☆19Oct 3, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
happyharrycn / actionformer_release
View on GitHub
Code release for ActionFormer (ECCV 2022)
☆571Apr 11, 2024Updated 2 years ago
TuanTNG / TemporalMaxer
View on GitHub
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
☆65Dec 6, 2025Updated 7 months ago
OpenGVLab / video-mamba-suite
View on GitHub
The suite of modeling video with Mamba
☆295May 14, 2024Updated 2 years ago
dingfengshi / TriDet
View on GitHub
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
☆219Dec 27, 2023Updated 2 years ago
zhou745 / GauFuse_WSTAL
View on GitHub
☆21May 8, 2023Updated 3 years ago
OpenGVLab / VideoChat-Flash
View on GitHub
[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
☆527Updated this week
HumamAlwassel / DETAD
View on GitHub
Diagnosing Error in Temporal Action Detectors (ECCV 2018)
☆78Nov 14, 2021Updated 4 years ago
Zhuo-Cao / FlashVTG
View on GitHub
FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)
☆39Apr 17, 2025Updated last year
RenHuan1999 / CVPR2023_P-MIL
View on GitHub
The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)
☆44Jun 1, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
saakur / EventSegmentation
View on GitHub
Code for CVPR 2019 paper
☆12Apr 26, 2019Updated 7 years ago
thwjoy / ccvae_pytorch
View on GitHub
Pytorch codebase for Capturing label characteristics in VAEs
☆13May 1, 2021Updated 5 years ago
OpenGVLab / VideoMAEv2
View on GitHub
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆803Oct 8, 2024Updated last year
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
trquhuytin / TOT-CVPR22
View on GitHub
Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering (CVPR 2022)
☆12Sep 22, 2023Updated 2 years ago
TimeMarker-LLM / TimeMarker
View on GitHub
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
☆107Nov 28, 2024Updated last year
princetonvisualai / merv
View on GitHub
Unifying Specialized Visual Encoders for Video Language Models
☆25Nov 22, 2025Updated 8 months ago
mayu-ot / hidden-challenges-MR
View on GitHub
codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval
☆20Sep 7, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Run542968 / GAP
View on GitHub
☆11Oct 13, 2024Updated last year
Visual-AI / FROSTER
View on GitHub
[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
☆101Jan 14, 2025Updated last year
zhenyingfang / Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
View on GitHub
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
☆593Updated this week
fmu2 / snag_release
View on GitHub
Official Implementation of SnAG (CVPR 2024)
☆59Apr 26, 2025Updated last year
Dotori-HJ / DiGIT
View on GitHub
[CVPR 2025] Official implementation of the paper "DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for T…
☆32Jul 1, 2025Updated last year
xlliu7 / MUSES
View on GitHub
[CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark
☆55Mar 19, 2022Updated 4 years ago
OpenGVLab / InternVideo
View on GitHub
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
☆2,339Jul 2, 2026Updated 3 weeks ago
yeliudev / R2-Tuning
View on GitHub
🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
☆91Jul 2, 2024Updated 2 years ago
XiaojunTang22 / ICCV2023-DDGNet
View on GitHub
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
☆18Sep 28, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sejong-rcv / PVLR
View on GitHub
[ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
☆13Oct 8, 2024Updated last year
md-mohaiminul / TranS4mer
View on GitHub
☆34Jun 2, 2023Updated 3 years ago
sauradip / DiffusionTAD
View on GitHub
[ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"
☆37Mar 30, 2023Updated 3 years ago
yangle15 / DyFADet-pytorch
View on GitHub
☆32Jul 4, 2024Updated 2 years ago
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
alpargun / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆12Jul 26, 2024Updated last year
DAVEISHAN / TimeBalance
View on GitHub
Placeholder
☆10Jul 17, 2023Updated 3 years ago