fawazsammani / clip-interpret-mutual-knowledgeLinks
Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge, NeurIPS 2024
☆15Updated 4 months ago
Alternatives and similar repositories for clip-interpret-mutual-knowledge
Users that are interested in clip-interpret-mutual-knowledge are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆80Updated 11 months ago
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆74Updated last week
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆85Updated 8 months ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆49Updated 11 months ago
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆45Updated 2 weeks ago
- ☆67Updated 11 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆28Updated last year
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆74Updated last year
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆68Updated 2 months ago
- [CVPR'24] Validation-free few-shot adaptation of CLIP, using a well-initialized Linear Probe (ZSLP) and class-adaptive constraints (CLAP)…☆74Updated 2 weeks ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆105Updated last year
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆131Updated last month
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆114Updated 2 months ago
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆51Updated 9 months ago
- [CVPR24] Official Implementation of GEM (Grounding Everything Module)☆126Updated 2 months ago
- ☆41Updated 5 months ago
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆167Updated last year
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆44Updated 9 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- An open source implementation of CLIP (With TULIP Support)☆157Updated last month
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆34Updated last year
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…☆48Updated 10 months ago
- ☆20Updated 7 months ago
- ☆42Updated last month
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆81Updated 2 months ago
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆50Updated 2 months ago
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆72Updated 2 months ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆47Updated 2 months ago
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆156Updated 8 months ago
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆20Updated 11 months ago