yossigandelsman / clip_text_spanView external linksLinks
official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"
☆233Jun 1, 2025Updated 8 months ago
Alternatives and similar repositories for clip_text_span
Users that are interested in clip_text_span are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆42Nov 15, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆138May 8, 2025Updated 9 months ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Oct 28, 2024Updated last year
- Localization of Knowledge in Text-to-Image Models☆12Oct 8, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆39Apr 18, 2025Updated 9 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆58Sep 3, 2024Updated last year
- Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference☆180Oct 10, 2024Updated last year
- Densely Captioned Images (DCI) dataset repository.☆196Jul 1, 2024Updated last year
- ☆360Jan 27, 2024Updated 2 years ago
- Sparse Linear Concept Embeddings☆130Mar 27, 2025Updated 10 months ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆50Jun 16, 2025Updated 7 months ago
- Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification☆107Feb 2, 2024Updated 2 years ago
- Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models☆107Mar 5, 2024Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆70Feb 28, 2024Updated last year
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆94Apr 29, 2024Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated last year
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆56Feb 1, 2024Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆56May 27, 2025Updated 8 months ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 10 months ago
- [ICCV2025] Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning☆23Nov 13, 2025Updated 3 months ago
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆866Jul 20, 2025Updated 6 months ago
- ☆200May 10, 2023Updated 2 years ago
- CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification☆105May 28, 2024Updated last year
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆17Feb 12, 2025Updated last year
- Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)☆27Apr 24, 2023Updated 2 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- [MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment☆15Nov 15, 2025Updated 3 months ago
- ☆175Dec 29, 2023Updated 2 years ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆292Jun 7, 2023Updated 2 years ago
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆106Aug 22, 2023Updated 2 years ago
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆50May 26, 2023Updated 2 years ago
- [ACL 2023] Delving into the Openness of CLIP☆24Jan 11, 2023Updated 3 years ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆288Jan 14, 2024Updated 2 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆14Sep 30, 2023Updated 2 years ago