MCG-NJU/VideoMAE-Action-Detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MCG-NJU/VideoMAE-Action-Detection)

MCG-NJU / VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

☆70

Alternatives and similar repositories for VideoMAE-Action-Detection

Users that are interested in VideoMAE-Action-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MCG-NJU / EVAD
View on GitHub
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
☆39Sep 27, 2023Updated 2 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,777Dec 8, 2023Updated 2 years ago
OpenGVLab / VideoMAEv2
View on GitHub
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
☆805Oct 8, 2024Updated last year
MCG-NJU / STMixer
View on GitHub
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆64May 18, 2023Updated 3 years ago
ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TencentYoutuResearch / ActionDetection-LSTC
View on GitHub
LSTC: Boosting Atomic Action Detection with Long-Short-Term Context
☆10Sep 1, 2022Updated 3 years ago
4paradigm-CV / SE-STAD
View on GitHub
☆10Jan 3, 2023Updated 3 years ago
jolin830 / SlowFast-Meet-ViT
View on GitHub
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …
☆14Nov 11, 2024Updated last year
HCPLab-SYSU / STKET
View on GitHub
Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)
☆19Mar 13, 2024Updated 2 years ago
MCG-NJU / MixSort
View on GitHub
[ICCV2023] MixSort: The Customized Tracker in SportsMOT
☆97Aug 21, 2023Updated 2 years ago
MCG-NJU / NeuralSolver
View on GitHub
[ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
☆21Jul 7, 2025Updated last year
amazon-science / tubelet-transformer
View on GitHub
This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection
☆96Apr 14, 2023Updated 3 years ago
MCG-NJU / JoMoLD
View on GitHub
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
☆27Jul 15, 2022Updated 4 years ago
facebookresearch / MeMViT
View on GitHub
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆155Nov 30, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yjh0410 / YOWOv2
View on GitHub
The second generation of YOWO action detector.
☆292May 9, 2024Updated 2 years ago
CAMMA-public / MultiBypass140
View on GitHub
☆22Sep 19, 2025Updated 10 months ago
joslefaure / HIT
View on GitHub
Official Implementation of our WACV2023 paper: “Holistic Interaction Transformer Network for Action Detection”
☆72Jan 9, 2025Updated last year
OpenGVLab / InternLMM
View on GitHub
☆16Jul 6, 2023Updated 3 years ago
Siyu-C / ACAR-Net
View on GitHub
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
☆215Oct 8, 2021Updated 4 years ago
MCG-NJU / CRCNN-Action
View on GitHub
Context-aware RCNN: a Baseline for Action Detection in Videos
☆51Oct 13, 2020Updated 5 years ago
MCG-NJU / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆23Jul 29, 2024Updated 2 years ago
FDC-WuWeb / Attention3d-codebase
View on GitHub
Implementation of 3D attention mechanisms based on https://github.com/LeftAttention/Attention-Codebase. Thanks to LeftAttetnion for shari…
☆12Feb 22, 2022Updated 4 years ago
dairui01 / PDAN
View on GitHub
[WACV2021] Implementation of Pyramid Dilated Attention Network (PDAN)
☆21May 18, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MCG-NJU / SPLAM
View on GitHub
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
☆24Nov 1, 2024Updated last year
yjh0410 / YOWOF
View on GitHub
You Only Watch One Frame for Online Spatio-Temporal Action Detection
☆37Jun 7, 2023Updated 3 years ago
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
haoyanbin918 / Attention-in-Attention
View on GitHub
☆12Aug 5, 2022Updated 3 years ago
mondalanindya / MSQNet
View on GitHub
Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
☆24Oct 20, 2023Updated 2 years ago
facebookresearch / hiera
View on GitHub
Hiera: A fast, powerful, and simple hierarchical vision transformer.
☆1,074Mar 2, 2024Updated 2 years ago
yjh0410 / AVA_Dataset
View on GitHub
download AVA dataset
☆23Sep 5, 2023Updated 2 years ago
XingruiWang / DynSuperCLEVR
View on GitHub
A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…
☆20Apr 23, 2025Updated last year
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
lllcho / LabelV
View on GitHub
视频分类标注、视频时空标注
☆47Aug 24, 2023Updated 2 years ago
MCG-NJU / BasicTAD
View on GitHub
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆52Jun 10, 2023Updated 3 years ago
dairui01 / MS-TCT
View on GitHub
[CVPR2022] MS-TCT
☆55Oct 8, 2022Updated 3 years ago
Wangt-CN / Code_CASC
View on GitHub
☆14Oct 14, 2019Updated 6 years ago
icedpanda / KERL
View on GitHub
[TNNLS] Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems
☆13Sep 25, 2025Updated 10 months ago
OpenGVLab / VideoMamba
View on GitHub
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
☆1,121Jul 6, 2024Updated 2 years ago
ydk122024 / CCIM
View on GitHub
[CVPR2023] Context De-confounded Emotion Recognition
☆18Jul 23, 2023Updated 3 years ago