tonychenxyz / vit-interpret
Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"
☆13Updated 11 months ago
Alternatives and similar repositories for vit-interpret
Users that are interested in vit-interpret are comparing it to the libraries listed below
Sorting:
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆20Updated 5 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆20Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆28Updated 2 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- [CVPR 2024 Highlight] ImageNet-D☆43Updated 7 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆39Updated 4 months ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆26Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆39Updated 2 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆37Updated 3 months ago
- ☆59Updated last year
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆32Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆30Updated 3 weeks ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆52Updated last week
- ☆11Updated 7 months ago
- ☆32Updated last year
- Training code for CLIP-FlanT5☆26Updated 9 months ago
- ☆17Updated 6 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated 3 months ago
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆30Updated 2 weeks ago
- [TIP] Exploring Effective Factors for Improving Visual In-Context Learning☆19Updated 7 months ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆24Updated 5 months ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Updated 2 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆39Updated 5 months ago
- Compress conventional Vision-Language Pre-training data☆51Updated last year
- ☆24Updated 11 months ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆30Updated last year
- ☆41Updated 6 months ago
- Official Implementation for PlugIn Inversion☆16Updated 3 years ago
- ☆38Updated last year