hyzhang98 / PiNI
Enhance Vision-Language Alignment with Noise (AAAI 2025)
β19Updated 4 months ago
Alternatives and similar repositories for PiNI:
Users that are interested in PiNI are comparing it to the libraries listed below
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoningβ21Updated 7 months ago
- πOfficial code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".β32Updated last month
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"β33Updated last week
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024β32Updated last year
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024β42Updated 9 months ago
- β45Updated last year
- [IJCV2025] https://arxiv.org/abs/2304.04521β14Updated 3 months ago
- [ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuningβ49Updated 11 months ago
- Official Repository for CVPR 2024 Paper: "Large Language Models are Good Prompt Learners for Low-Shot Image Classification"β35Updated 9 months ago
- β23Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentationβ41Updated this week
- β14Updated last year
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". β¦β52Updated 5 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Modelsβ72Updated 9 months ago
- Code for dmrnetβ22Updated 2 months ago
- The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)β19Updated 5 months ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Modelsβ17Updated 9 months ago
- [ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detectionβ22Updated 2 weeks ago
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]β35Updated last month
- [NeurIPS 2023] Meta-Adapterβ48Updated last year
- β34Updated last year
- β18Updated 5 months ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)β31Updated 10 months ago
- Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)β69Updated 2 months ago
- Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)β36Updated 9 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learningβ26Updated 2 years ago
- β22Updated 8 months ago
- β19Updated 6 months ago
- [AAAI2024] Official implementation of TGP-Tβ28Updated last year
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models wβ¦β23Updated 11 months ago