Zhiyuan-Li-John / MuCRLinks
MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities
☆17Updated 4 months ago
Alternatives and similar repositories for MuCR
Users that are interested in MuCR are comparing it to the libraries listed below
Sorting:
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆32Updated last year
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆65Updated 2 months ago
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆82Updated 3 months ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆21Updated 7 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆43Updated 6 months ago
- AdaMoLE: Adaptive Mixture of LoRA Experts☆36Updated 11 months ago
- VHTest☆14Updated 10 months ago
- ☆17Updated last year
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection☆46Updated last month
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆43Updated 2 months ago
- [NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"☆62Updated 3 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆12Updated 7 months ago
- 【NeurIPS 2024】Official implementation of "Visual Fourier Prompt Tuning"☆34Updated 8 months ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Updated last year
- ☆10Updated 6 months ago
- CLIP-MoE: Mixture of Experts for CLIP☆46Updated 11 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 7 months ago
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆148Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆29Updated 6 months ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆57Updated last year
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆56Updated 3 months ago
- The efficient tuning method for VLMs☆79Updated last year
- ☆142Updated 9 months ago
- Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models☆36Updated 5 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆53Updated 5 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆15Updated 3 months ago
- Code for ICML 2024 paper (Oral) — Test-Time Model Adaptation with Only Forward Passes☆87Updated last year
- ICCV 2025: Official Implematation of "Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced L…☆36Updated 3 weeks ago
- ☆28Updated last year
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆28Updated last month