ruiwang2021 / mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
☆110Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mvd
- ☆169Updated 2 years ago
- [CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames☆32Updated 4 months ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆85Updated 2 months ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆71Updated last year
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆172Updated last year
- Official Implementation of SnAG (CVPR 2024)☆37Updated 3 weeks ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆54Updated last year
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆104Updated last year
- ☆44Updated 10 months ago
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 7 months ago
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆165Updated 10 months ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆98Updated last year
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆82Updated last year
- ☆187Updated 2 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆49Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆52Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆249Updated 7 months ago
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆38Updated 4 months ago
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆187Updated last month
- Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".☆90Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE for Action Detection☆51Updated last year
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆294Updated 7 months ago
- The suite of modeling video with Mamba☆238Updated 6 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆90Updated 4 months ago
- ☆36Updated 7 months ago
- ☆106Updated 9 months ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆50Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆145Updated last year