Zhiyuan-Li-John / MuCR
MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities
☆15Updated 3 months ago
Alternatives and similar repositories for MuCR
Users that are interested in MuCR are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"☆51Updated 6 months ago
- ☆17Updated 9 months ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆17Updated 2 months ago
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 5 months ago
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆20Updated 9 months ago
- Enhance Vision-Language Alignment with Noise (AAAI 2025)☆23Updated 5 months ago
- ☆17Updated 4 months ago
- [ICLR 2023] Official code repository for "Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning"☆59Updated last year
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆50Updated last year
- [CVPR 2024] Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification☆31Updated last year
- ☆11Updated last year
- [IJCV2025] https://arxiv.org/abs/2304.04521☆14Updated 3 months ago
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning☆25Updated last year
- [ICCVW2023] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning☆18Updated last year
- ☆10Updated 4 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆30Updated 7 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆48Updated 2 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 6 months ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆39Updated 5 months ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆20Updated 2 months ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆14Updated last year
- [CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation☆41Updated 9 months ago
- ☆22Updated 11 months ago
- ☆16Updated 7 months ago
- ☆13Updated 2 years ago
- ☆10Updated 2 months ago
- Code for paper 'Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity…☆12Updated last year
- Towards Unified and Effective Domain Generalization☆31Updated last year
- Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment, arXiv 2024 / CVPR 2025☆27Updated 2 months ago
- Domain Generalization through Distilling CLIP with Language Guidance☆29Updated last year