ssfgunner / IISLinks
[ICLR 2025 Spotlight] This is the official repository for our paper: ''Enhancing Pre-trained Representation Classifiability can Boost its Interpretability''.
☆25Updated 9 months ago
Alternatives and similar repositories for IIS
Users that are interested in IIS are comparing it to the libraries listed below
Sorting:
- ☆80Updated 3 months ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆31Updated 9 months ago
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆91Updated last year
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆378Updated last year
- ☆175Updated 2 years ago
- ☆56Updated last year
- Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))☆207Updated 3 years ago
- up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources☆265Updated this week
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆101Updated last year
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆749Updated 2 months ago
- [WACV 2025] Code for Enhancing Vision-Language Few-Shot Adaptation with Negative Learning☆11Updated 11 months ago
- [ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"☆107Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆348Updated last month
- Collection of Composed Image Retrieval (CIR) papers.☆306Updated last month
- [ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…☆284Updated 2 years ago
- Awsome of VLM-CL. Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting☆149Updated last week
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆34Updated 3 months ago
- Pytorch implementation of "Test-time Adaptation for Cross-modal Retrieval with Query Shift".☆31Updated 2 months ago
- ☆18Updated last year
- 关于LLM和Multimodal LLM的paper list☆56Updated 3 weeks ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆77Updated 3 months ago
- ☆27Updated 9 months ago
- [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”☆46Updated 2 years ago
- ☆157Updated 11 months ago
- This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation …☆509Updated 10 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆340Updated 9 months ago
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆43Updated last year
- Visualizing the attention of vision-language models☆279Updated 11 months ago
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆121Updated last year
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆58Updated last year