divyam3897 / I2M2Links
I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)
☆22Updated last year
Alternatives and similar repositories for I2M2
Users that are interested in I2M2 are comparing it to the libraries listed below
Sorting:
- Active Learning in the era of Foundation Models☆11Updated 8 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆41Updated 3 months ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆98Updated last year
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆74Updated 2 years ago
- Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)☆15Updated last year
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions☆84Updated last year
- Sparse Linear Concept Embeddings☆127Updated 9 months ago
- Symile is a flexible, architecture-agnostic contrastive loss that enables training modality-specific representations for any number of mo…☆46Updated 9 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆55Updated last year
- [ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…☆128Updated last year
- Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"☆36Updated last year
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆34Updated 8 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆31Updated last month
- Expert-level AI radiology report evaluator☆35Updated 9 months ago
- MultiModN – Multimodal, Multi-Task, Interpretable Modular Networks (NeurIPS 2023)☆35Updated 2 years ago
- [NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models☆31Updated last year
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆38Updated last year
- Official Implementation of "Geometric Multimodal Contrastive Representation Learning" (https://arxiv.org/abs/2202.03390)☆28Updated last year
- ☆32Updated last year
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated last month
- ☆17Updated 2 years ago
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆77Updated last year
- ☆70Updated 6 months ago
- [NeurIPS 2025 D&B Spotlight] CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays☆27Updated 2 months ago
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP models☆30Updated 9 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆61Updated last year
- [ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"☆19Updated 2 years ago
- KAIST medical VL research group☆19Updated last year
- ☆39Updated 5 months ago