Zhiyuan-Li-John / MuCR
MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities
☆14Updated last month
Alternatives and similar repositories for MuCR:
Users that are interested in MuCR are comparing it to the libraries listed below
- [CVPR 2024] Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification☆28Updated last year
- [CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation☆36Updated 7 months ago
- [COLING'25] HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding☆38Updated 3 months ago
- [ICLR 2024 Oral] Less is More: Fewer Interpretable Region via Submodular Subset Selection☆78Updated 5 months ago
- ☆10Updated this week
- [NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"☆34Updated 4 months ago
- Domain Generalization through Distilling CLIP with Language Guidance☆28Updated last year
- The efficient tuning method for VLMs☆81Updated last year
- ☆21Updated 7 months ago
- ID-like Prompt Learning for Few-Shot Out-of-Distribution Detection☆22Updated 10 months ago
- [IJCV2025] https://arxiv.org/abs/2304.04521☆11Updated last month
- [ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models☆67Updated 9 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆36Updated 3 weeks ago
- ☆13Updated 4 months ago
- This repository contains the code for AdaCLIP, a computation and latency-aware system for pragmatic multimodal video retrieval.☆10Updated 9 months ago
- Benchmarking Generalized Out-of-Distribution Detection with Vision-Language Models☆21Updated 2 months ago
- Code for the paper Visual Explanations of Image–Text Representations via Multi-Modal Information Bottleneck Attribution☆47Updated 11 months ago
- Test-time adaptation via Nearest neighbor information (TAST), submitted to ICLR'23☆22Updated last year
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆32Updated last year
- Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models☆26Updated 4 months ago
- Official implementation of NeurIPS 2024 "Visual Fourier Prompt Tuning"☆27Updated 2 months ago
- Official implementation of "Towards Distribution-Agnostic Generalized Category Discovery" (NIPS 2023)☆25Updated last year
- ☆20Updated 10 months ago
- [CVPR 2024] TEA: Test-time Energy Adaptation☆60Updated last year
- [ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition☆50Updated 7 months ago