sangminwoo / ActionMAELinks
[AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"
☆23Updated 3 years ago
Alternatives and similar repositories for ActionMAE
Users that are interested in ActionMAE are comparing it to the libraries listed below
Sorting:
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆45Updated 2 years ago
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆30Updated last year
- Placeholder☆10Updated 2 years ago
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆32Updated 2 years ago
- Accepted at ICCV '23☆14Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Updated 2 years ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆44Updated last year
- Code for dmrnet☆29Updated 6 months ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆50Updated last year
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆25Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆41Updated last year
- TupleInfoNCE ICCV21☆17Updated 3 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆18Updated 5 months ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆37Updated last year
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆53Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆73Updated 10 months ago
- ☆85Updated 2 years ago
- ☆30Updated 3 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Updated last year
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆36Updated 2 years ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆28Updated 8 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆42Updated 2 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Updated 2 years ago
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆19Updated last year
- Official implementation for CIGN☆17Updated 2 years ago
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆226Updated 2 years ago
- ☆49Updated 3 years ago