fawazsammani / clip-interpret-mutual-knowledgeLinks
Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge, NeurIPS 2024
☆15Updated 4 months ago
Alternatives and similar repositories for clip-interpret-mutual-knowledge
Users that are interested in clip-interpret-mutual-knowledge are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆94Updated last week
- Code for the paper "Compositional Entailment Learning for Hyperbolic Vision-Language Models".☆85Updated 4 months ago
- Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023☆54Updated 2 years ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆232Updated 5 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆91Updated last year
- [ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models☆170Updated last year
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆107Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆37Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆43Updated last year
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆76Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆122Updated 6 months ago
- Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024☆253Updated last month
- [CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…☆50Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆58Updated 9 months ago
- Official Implementation of the ECCV 2024 Paper: "CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts"☆54Updated last week
- [AAAI'25, CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".☆114Updated 10 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆29Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆30Updated 7 months ago
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆64Updated 3 months ago
- NegCLIP.☆37Updated 2 years ago
- ☆69Updated last year
- ☆57Updated 3 weeks ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆33Updated 7 months ago
- ☆43Updated 3 weeks ago
- Open source implementation of "Vision Transformers Need Registers"☆197Updated last week
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆84Updated last year
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆110Updated 2 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆162Updated 3 years ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆54Updated last year
- Detail-Oriented CLIP for Fine-Grained Tasks (ICLR SSI-FM 2025)☆55Updated 7 months ago