The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
☆54Jun 28, 2024Updated last year
Alternatives and similar repositories for MMPareto_ICML2024
Users that are interested in MMPareto_ICML2024 are comparing it to the libraries listed below
Sorting:
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆59Nov 5, 2024Updated last year
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆46Mar 10, 2023Updated 2 years ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆311Sep 22, 2025Updated 5 months ago
- Codebase of 'MADE-for-ASD: A Multi-Atlas Deep Ensemble Network for Diagnosing Autism Spectrum Disorder'☆12Jun 3, 2025Updated 9 months ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆54Apr 1, 2024Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆28Jan 26, 2024Updated 2 years ago
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆30Jul 30, 2024Updated last year
- Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)☆89Jul 25, 2024Updated last year
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆27Jul 18, 2025Updated 7 months ago
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Dec 9, 2025Updated 2 months ago
- Decoupling common and unique representations for multimodal self-supervised learning☆72Aug 14, 2024Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- ☆14Jun 17, 2024Updated last year
- ☆36Jul 9, 2025Updated 7 months ago
- Improving autism identification with multisite data via site-dependence minimisation and second-order functional connectivity (TMI, 2022)☆38Jul 19, 2023Updated 2 years ago
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆36Jun 17, 2023Updated 2 years ago
- A curated list of balanced multimodal learning methods.☆159Updated this week
- Code release of paper "ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning" (NeurIPS 2023)☆17Dec 30, 2023Updated 2 years ago
- [ECCV 2024] Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision☆43May 23, 2025Updated 9 months ago
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated last month
- [ICCV 2025] SAS: Segment Any 3D Scene with Integrated 2D Priors☆31Jun 25, 2025Updated 8 months ago
- ☆44May 20, 2025Updated 9 months ago
- Model implementation and trained network for "A Two-Step Disentanglement Method" by Naama Hadad, Lior Wolf and Moni Shahar☆21Mar 21, 2018Updated 7 years ago
- The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferabili…☆20Aug 22, 2024Updated last year
- ☆26Mar 27, 2025Updated 11 months ago
- ☆23Apr 10, 2023Updated 2 years ago
- [CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆101Mar 13, 2024Updated last year
- ☆23Nov 15, 2022Updated 3 years ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆61Mar 19, 2025Updated 11 months ago
- ☆34Jul 25, 2024Updated last year
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆26Nov 15, 2023Updated 2 years ago
- A multimodal (i.e., Sentinel-2, Sentinel-1, and SRTM) remote sensing dataset in Hunan, China.☆33Mar 31, 2024Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Nov 10, 2024Updated last year
- Code repository for "Parameter Efficient Self-supervised Geospatial Domain Adaptation", CVPR 2024☆36Jul 29, 2024Updated last year
- An implacation of SignGraph: A Sign Sequence is Worth Graphs of Nodes (CVPR2024)☆32Nov 27, 2025Updated 3 months ago
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆32Jan 24, 2024Updated 2 years ago