sangminwoo / ActionMAELinks
[AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"
☆23Updated 3 years ago
Alternatives and similar repositories for ActionMAE
Users that are interested in ActionMAE are comparing it to the libraries listed below
Sorting:
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆42Updated 2 years ago
- Placeholder☆10Updated 2 years ago
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆28Updated last year
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆32Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆30Updated 2 years ago
- Accepted at ICCV '23☆14Updated 2 years ago
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆25Updated 2 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆16Updated last year
- ☆26Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Updated 2 years ago
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆36Updated 2 years ago
- ☆29Updated 3 years ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆71Updated 9 months ago
- Proportional Amplitude Spectrum Training Augmentation for Synthetic to Real Domain Generalization☆21Updated last year
- ☆49Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆50Updated last year
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆36Updated last year
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆41Updated 2 years ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆18Updated 4 months ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Updated 2 years ago
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆44Updated 11 months ago
- Code for dmrnet☆29Updated 4 months ago
- ICML2024-ReconBoost: Boosting Can Achieve Modality Reconcilement☆28Updated 7 months ago
- [ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition☆38Updated last year
- ☆27Updated 2 years ago
- TupleInfoNCE ICCV21☆17Updated 3 years ago
- Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.☆32Updated 2 years ago
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆39Updated last year
- [ECCV2024] Nonverbal Interaction Detection☆28Updated last year
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Updated last year