Duplums / CoMMLinks
[ICLR 2025] Multi-modal representation learning of shared, unique and synergistic features between modalities
☆47Updated 5 months ago
Alternatives and similar repositories for CoMM
Users that are interested in CoMM are comparing it to the libraries listed below
Sorting:
- The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745☆109Updated last year
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆58Updated last year
- [ICASSP 2025] Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"☆60Updated this week
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆16Updated last year
- The official code repository of ShaSpec model from CVPR 2023 [paper](https://arxiv.org/pdf/2307.14126) "Multi-modal Learning with Missing…☆77Updated 6 months ago
- Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805☆98Updated 3 months ago
- Official PyTorch Implementation of Guarding Barlow Twins Against Overfitting with Mixed Samples☆19Updated last year
- Pytorch implementation of "Test-time Adaption against Multi-modal Reliability Bias".☆40Updated 9 months ago
- Decoupling common and unique representations for multimodal self-supervised learning☆67Updated last year
- Official PyTorch Implementation for Active Prompt Learning in Vision Language Models☆37Updated last year
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023☆17Updated 2 years ago
- This is official implementation of "Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise"…☆20Updated 7 months ago
- Adaptation of vision-language models (CLIP) to downstream tasks using local and global prompts.☆48Updated 3 months ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆118Updated last year
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆218Updated last year
- The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".☆82Updated 5 months ago
- ☆20Updated 6 months ago
- [ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data☆111Updated 3 months ago
- [ECCV 2022] What to Hide from Your Students: Attention-Guided Masked Image Modeling☆74Updated last year
- [CVPR 2024] TEA: Test-time Energy Adaptation☆71Updated last year
- Some Useful Tools Code☆16Updated 3 months ago
- ☆60Updated last year
- ☆56Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆56Updated 11 months ago
- This is an official implementation for PROMPT-CAM: A Simpler Interpretable Transformer for Fine-Grained Analysis (CVPR'25)☆47Updated 6 months ago
- ☆32Updated 10 months ago
- [ECCV 2024] TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data (an official implementation)☆72Updated 6 months ago
- A curated publication list on evidential deep learning.☆136Updated 6 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆78Updated 2 years ago
- Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]☆31Updated 11 months ago