ssfgunner / IISLinks
[ICLR 2025 Spotlight] This is the official repository for our paper: ''Enhancing Pre-trained Representation Classifiability can Boost its Interpretability''.
☆15Updated last month
Alternatives and similar repositories for IIS
Users that are interested in IIS are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] This is the official repository for our paper: ''Expanding Sparse Tuning for Low Memory Usage''.☆15Updated 2 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆74Updated 4 months ago
- ☆17Updated 7 months ago
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆23Updated 6 months ago
- ☆167Updated last year
- ☆64Updated 3 months ago
- [AAAI2024] Official implementation of TGP-T☆28Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆167Updated last year
- ☆21Updated 2 years ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆26Updated last month
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆39Updated 6 months ago
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…☆11Updated 2 months ago
- [NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models☆36Updated 8 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆37Updated 3 months ago
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆13Updated last month
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆46Updated 2 weeks ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆215Updated 3 weeks ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆145Updated 3 months ago
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆41Updated 3 weeks ago
- Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models☆29Updated this week
- [CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"☆104Updated 3 weeks ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆265Updated last year
- ☆31Updated 9 months ago
- ✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).☆48Updated 2 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆80Updated 5 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆51Updated 9 months ago
- ☆39Updated last year
- Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval☆21Updated 2 months ago
- ☆8Updated 6 months ago
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆16Updated last year