yossigandelsman/clip_text_span

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yossigandelsman/clip_text_span)

yossigandelsman / clip_text_span

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

☆234

Alternatives and similar repositories for clip_text_span

Users that are interested in clip_text_span are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yossigandelsman / second_order_lens
View on GitHub
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆42Nov 15, 2024Updated last year
samyadeepbasu / LocoGen
View on GitHub
Localization of Knowledge in Text-to-Image Models
☆11Oct 8, 2024Updated last year
tmlr-group / WCA
View on GitHub
[ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"
☆59Sep 3, 2024Updated last year
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
facebookresearch / DCI
View on GitHub
Densely Captioned Images (DCI) dataset repository.
☆197Jul 1, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AI4LIFE-GROUP / SpLiCE
View on GitHub
Sparse Linear Concept Embeddings
☆133Mar 27, 2025Updated last year
tonychenxyz / vit-interpret
View on GitHub
Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"
☆14May 29, 2024Updated 2 years ago
tsb0601 / MMVP
View on GitHub
☆365Jan 27, 2024Updated 2 years ago
sarahpratt / CuPL
View on GitHub
☆203May 10, 2023Updated 3 years ago
Thunderbeee / ZSCL
View on GitHub
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
☆110Mar 5, 2024Updated 2 years ago
wangf3014 / SCLIP
View on GitHub
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
☆192Jul 16, 2026Updated last week
ylingfeng / FGVP
View on GitHub
Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023
☆57Feb 1, 2024Updated 2 years ago
CVMI-Lab / clip-beyond-tail
View on GitHub
(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
☆27Oct 28, 2024Updated last year
Vinoground / Vinoground
View on GitHub
☆13Apr 13, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
cwj1412 / MSCOCO-Flikcr30K_FG
View on GitHub
Benchmark data for "Rethinking Benchmarks for Cross-modal Image-text Retrieval" (SIGIR 2023)
☆28Apr 24, 2023Updated 3 years ago
Pter61 / context-i2w
View on GitHub
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
☆54May 27, 2025Updated last year
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
altndrr / vic
View on GitHub
Code implementation of our NeurIPS 2023 paper: Vocabulary-free Image Classification
☆107Feb 2, 2024Updated 2 years ago
alinlab / s-clip
View on GitHub
S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions (NeurIPS 2023)
☆51May 26, 2023Updated 3 years ago
SunzeY / AlphaCLIP
View on GitHub
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
☆876Jul 20, 2025Updated last year
amitakamath / whatsup_vlms
View on GitHub
Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".
☆71Feb 28, 2024Updated 2 years ago
wjpoom / SPEC
View on GitHub
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
☆52Jun 16, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
yangyangyang127 / APE
View on GitHub
[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"
☆150Apr 21, 2024Updated 2 years ago
YueYANG1996 / LaBo
View on GitHub
CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
☆108May 28, 2024Updated 2 years ago
facebookresearch / MetaCLIP
View on GitHub
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
☆1,850Nov 27, 2025Updated 8 months ago
mertyg / vision-language-models-are-bows
View on GitHub
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …
☆294Jun 7, 2023Updated 3 years ago
mc-lan / ClearCLIP
View on GitHub
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
☆100Mar 26, 2025Updated last year
muzairkhattak / PromptSRC
View on GitHub
[ICCV'23 Main Track, WECIA'23 Oral] Official repository of paper titled "Self-regulating Prompts: Foundational Model Adaptation without F…
☆286Sep 28, 2023Updated 2 years ago
LijieFan / LaCLIP
View on GitHub
[NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"
☆291Jan 14, 2024Updated 2 years ago
microsoft / klite
View on GitHub
[NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222
☆54Jun 12, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
vishaal27 / SuS-X
View on GitHub
Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]
☆104Aug 22, 2023Updated 2 years ago
thuml / CLIPood
View on GitHub
About Code Release for "CLIPood: Generalizing CLIP to Out-of-Distributions" (ICML 2023), https://arxiv.org/abs/2302.00864
☆70Sep 17, 2023Updated 2 years ago
baaivision / CapsFusion
View on GitHub
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
☆215Feb 27, 2024Updated 2 years ago
Annusha / xmic
View on GitHub
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
☆11Nov 7, 2024Updated last year
beichenzbc / Long-CLIP
View on GitHub
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
☆901Aug 13, 2024Updated last year
ys-zong / MIRB
View on GitHub
Benchmarking Multi-Image Understanding in Vision and Language Models
☆11Jul 29, 2024Updated 2 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago