sangminwoo / ActionMAELinks
[AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"
☆22Updated 2 years ago
Alternatives and similar repositories for ActionMAE
Users that are interested in ActionMAE are comparing it to the libraries listed below
Sorting:
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆26Updated last year
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated 2 years ago
- ☆26Updated 2 years ago
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆28Updated last year
- Accepted at ICCV '23☆13Updated last year
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Updated 2 years ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆40Updated 2 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆17Updated last year
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆52Updated 2 years ago
- Placeholder☆10Updated 2 years ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆68Updated 5 months ago
- ☆82Updated 2 years ago
- [CVPR'23] Official PyTorch implementation of Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification…☆44Updated last year
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆72Updated last year
- [ECCV2024] Nonverbal Interaction Detection☆27Updated 10 months ago
- [NeurIPS 2022] code for the paper, SemMAE: Semantic-guided masking for learning masked autoencoders☆38Updated 2 years ago
- Proportional Amplitude Spectrum Training Augmentation for Synthetic to Real Domain Generalization☆21Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 2 years ago
- Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024☆23Updated last year
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆45Updated last year
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆14Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆49Updated 10 months ago
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆215Updated last year
- ☆27Updated 3 years ago
- Code for dmrnet☆26Updated last month
- Official code for "BoMD: Bag of Multi-label Descriptors for Noisy Chest X-ray Classification"☆26Updated last year
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆37Updated 8 months ago
- [AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP☆15Updated 3 weeks ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆16Updated 10 months ago
- TupleInfoNCE ICCV21☆17Updated 3 years ago