ssfgunner / IISLinks
[ICLR 2025 Spotlight] This is the official repository for our paper: ''Enhancing Pre-trained Representation Classifiability can Boost its Interpretability''.
☆22Updated 6 months ago
Alternatives and similar repositories for IIS
Users that are interested in IIS are comparing it to the libraries listed below
Sorting:
- ☆77Updated 3 weeks ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆339Updated last year
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆73Updated 2 weeks ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆703Updated this week
- Collection of Composed Image Retrieval (CIR) papers.☆274Updated 2 weeks ago
- [NeurIPS 2024] Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models☆39Updated last year
- This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation …☆502Updated 8 months ago
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆280Updated 2 years ago
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆199Updated 3 years ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆208Updated last month
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆87Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆339Updated 2 months ago
- ☆175Updated last year
- The official GitHub page for the survey paper "CLIP-Powered Domain Generalization and Domain Adaptation: A Comprehensive Survey". And thi…☆55Updated this week
- [CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".☆786Updated 2 years ago
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆42Updated 11 months ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆26Updated 7 months ago
- ☆53Updated 11 months ago
- 关于LLM和Multimodal LLM的paper list☆50Updated last month
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆101Updated 11 months ago
- Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching (ACM SIGIR 2024, Pytorch Code)☆24Updated 9 months ago
- A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.☆410Updated last year
- Awsome of VLM-CL. Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting☆111Updated last week
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆68Updated 3 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)☆73Updated 9 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆293Updated 7 months ago
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆23Updated 4 months ago
- [NeurIPS 2023] Generalized Logit Adjustment☆39Updated last year
- Awesome papers & datasets specifically focused on long-term videos.☆328Updated last month
- ☆145Updated 11 months ago