alibaba-mmai-research/Masked-Action-Recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba-mmai-research/Masked-Action-Recognition)

alibaba-mmai-research / Masked-Action-Recognition

Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition

☆32

Alternatives and similar repositories for Masked-Action-Recognition

Users that are interested in Masked-Action-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
alibaba-mmai-research / HiCo
View on GitHub
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆18Aug 10, 2022Updated 3 years ago
MartinXM / TPS
View on GitHub
A simple but efficient transformer model for video action recognition
☆64Oct 8, 2022Updated 3 years ago
lambert-x / video-semisup
View on GitHub
Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)
☆30Dec 1, 2022Updated 3 years ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
cybercore-co-ltd / AICity2022-Track3
View on GitHub
☆10Jun 28, 2022Updated 4 years ago
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
kennymckormick / TransRank
View on GitHub
[CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…
☆18Aug 1, 2022Updated 3 years ago
EHZ9NIWI7 / MSF-GZSSAR
View on GitHub
Official code of the MSF model for GZSSAR (ICIG 2023)
☆13Jan 3, 2026Updated 6 months ago
alibaba-mmai-research / HyRSMPlusPlus
View on GitHub
Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".
☆15Jan 3, 2023Updated 3 years ago
wgcban / adamae
View on GitHub
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆84Feb 2, 2024Updated 2 years ago
gebob19 / M2A
View on GitHub
Code associated with "M2A: Motion Aware Attention for Accurate Video Action Recognition"
☆12Jan 25, 2022Updated 4 years ago
MCG-NJU / MGMAE
View on GitHub
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
casper9429-kth / Siamese-Masked-Autoencoders---Learning-and-Exploration
View on GitHub
Course: DD2412 Deep Learning Advanced at KTH Project by Casper, Magnus, and Friso Focus: Self-supervised learning and computer vision wit…
☆12Dec 15, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dipika-singhania / ICC-Semi-Supervised-TAS
View on GitHub
Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation
☆11Jul 24, 2023Updated 3 years ago
Boeun-Kim / GL-Transformer
View on GitHub
This is the official implementation of Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning (ECCV 2022).
☆23Nov 6, 2023Updated 2 years ago
alibaba-mmai-research / MoLo
View on GitHub
Code for our CVPR 2023 paper "MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition".
☆52Mar 7, 2024Updated 2 years ago
YonghaoHe / DSLA
View on GitHub
official code for Dynamic Smooth Label Assignment
☆12Oct 5, 2022Updated 3 years ago
yhZhai / SOAR
View on GitHub
[ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".
☆12Dec 23, 2023Updated 2 years ago
CVIR / TCL
View on GitHub
Semi-Supervised Action Recognition with Temporal Contrastive Learning
☆59Mar 22, 2024Updated 2 years ago
Cogito2012 / DEAR
View on GitHub
[ICCV 2021 Oral] Deep Evidential Action Recognition
☆133Sep 4, 2023Updated 2 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
seominseok0429 / inception-I3D-NON-LOCAL
View on GitHub
Inception-I3D, Non Local finetune, hmdb51_flow
☆15Oct 15, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
UCSC-VLAA / DMAE
View on GitHub
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
☆109Jul 24, 2023Updated 3 years ago
yzfly / TCM
View on GitHub
TCM: Temporal Correlation Module
☆17Apr 24, 2021Updated 5 years ago
kahnchana / svt
View on GitHub
Official repository for "Self-Supervised Video Transformer" (CVPR'22)
☆109Jun 26, 2024Updated 2 years ago
sunilhoho / EVEREST
View on GitHub
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
☆31Jun 15, 2024Updated 2 years ago
uark-cviu / DirecFormer
View on GitHub
[CVPR'22] DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
☆27Sep 28, 2022Updated 3 years ago
liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
xjchenGit / awesome-audio-visual-deepfake
View on GitHub
awesome-audio-visual-robustness
☆11Jan 27, 2024Updated 2 years ago
ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,777Dec 8, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ruiyan1995 / Interactive_Fusion_for_CAR
View on GitHub
☆16Jan 6, 2025Updated last year
titania7777 / UCF101FewShot
View on GitHub
Testing code for few-shot action recognition
☆11Jan 12, 2021Updated 5 years ago
csinva / cookiecutter-ml-research
View on GitHub
A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪
☆19Apr 9, 2026Updated 3 months ago
haithanhp / mixconv_pytorch
View on GitHub
☆12Aug 23, 2019Updated 6 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
alibaba-mmai-research / TAdaConv
View on GitHub
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆246Aug 23, 2023Updated 2 years ago
linziyi96 / st-adapter
View on GitHub
☆87May 8, 2023Updated 3 years ago