yossigandelsman / second_order_lens
Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
☆39Updated 5 months ago
Alternatives and similar repositories for second_order_lens:
Users that are interested in second_order_lens are comparing it to the libraries listed below
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- ☆53Updated 6 months ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆57Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆45Updated last year
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆68Updated 2 weeks ago
- ☆24Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆36Updated last year
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆76Updated 7 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆72Updated 10 months ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆11Updated 8 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆76Updated last year
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆67Updated 2 months ago
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆79Updated last year
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation☆31Updated last year
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆38Updated last month
- ☆16Updated 4 months ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆57Updated last year
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆54Updated 3 months ago
- ☆22Updated 11 months ago
- https://arxiv.org/abs/2209.15162☆49Updated 2 years ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆36Updated 3 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆28Updated 5 months ago
- LCA-on-the-line (ICML 2024 Oral)☆11Updated 2 months ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆32Updated last year
- What do we learn from inverting CLIP models?☆54Updated last year
- Official PyTorch Implementation of "Rosetta Neurons: Mining the Common Units in a Model Zoo"☆30Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆65Updated 11 months ago
- [CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models☆23Updated 2 months ago
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆83Updated last year