The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
☆54Jun 28, 2024Updated last year
Alternatives and similar repositories for MMPareto_ICML2024
Users that are interested in MMPareto_ICML2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆60Nov 5, 2024Updated last year
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆46Mar 10, 2023Updated 3 years ago
- A python implement for Certifiable Robust Multi-modal Training☆19Jun 21, 2025Updated 9 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆310Sep 22, 2025Updated 6 months ago
- [ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".☆28Jan 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆29Jul 18, 2025Updated 8 months ago
- Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)☆90Jul 25, 2024Updated last year
- The repo for "Diagnosing and Re-learning for Balanced Multi-modal Learning", ECCV 2024☆30Jul 30, 2024Updated last year
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 5 months ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024☆19Sep 29, 2024Updated last year
- Future Technologies Conference 2025 - MULTIMODAL EMOTION RECOGNITION AND SENTIMENT ANALYSIS IN MULTI-PARTY CONVERSATION CONTEXTS☆13Sep 12, 2024Updated last year
- Decoupling common and unique representations for multimodal self-supervised learning☆73Aug 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Dec 9, 2025Updated 3 months ago
- ☆36Jul 9, 2025Updated 8 months ago
- Code repository for "Parameter Efficient Self-supervised Geospatial Domain Adaptation", CVPR 2024☆36Jul 29, 2024Updated last year
- A multimodal (i.e., Sentinel-2, Sentinel-1, and SRTM) remote sensing dataset in Hunan, China.☆33Mar 31, 2024Updated last year
- ☆18Jun 26, 2025Updated 8 months ago
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆37Jun 17, 2023Updated 2 years ago
- Official PyTorch implementation of our TGRS paper: Deep Adaptive Pansharpening via Uncertainty-aware Image Fusion.☆12Aug 7, 2023Updated 2 years ago
- The official code of IEEE S&P 2024 paper "Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferabili…☆20Aug 22, 2024Updated last year
- Official PyTorch code for "Vector Quantization Prompting for Continual Learning (NeurIPS2024)".☆11Oct 16, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [AAAI 2023 Oral] Official pytorch implementation of "Towards Good Practices for Missing Modality Robust Action Recognition"☆24Dec 1, 2022Updated 3 years ago
- ☆12May 12, 2025Updated 10 months ago
- Accepted at ICCV '23☆15Oct 4, 2023Updated 2 years ago
- ☆23Apr 10, 2023Updated 2 years ago
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆72Jan 4, 2026Updated 2 months ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- ☆44May 20, 2025Updated 10 months ago
- The code of MetaViewer: Towards A Unified Multi-View Representation (CVPR 2023).☆10Nov 20, 2023Updated 2 years ago
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆17Aug 8, 2025Updated 7 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Codebase for "Multimodal Distillation for Egocentric Action Recognition" (ICCV 2023)☆32Jan 24, 2024Updated 2 years ago
- [MICCAI 2023] Official implementation of our MICCAI 2023 paper "Pick the Best Pre-trained Model: Towards Transferability Estimation for M…☆13Jul 27, 2023Updated 2 years ago
- Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]☆72Sep 6, 2023Updated 2 years ago
- ☆27Mar 27, 2025Updated 11 months ago
- Official Code for "RUN: Reversible Unfolding Network for Concealed Object Segmentation". A SOTA algorithm in camouflaged object detection…☆18Jun 3, 2025Updated 9 months ago
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions☆87Oct 28, 2024Updated last year
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆14Dec 3, 2024Updated last year