AI4LIFE-GROUP / SpLiCE
Sparse Linear Concept Embeddings
☆47Updated last month
Related projects: ⓘ
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆59Updated 3 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆84Updated 5 months ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆62Updated 10 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆149Updated this week
- Official repository of paper "Subobject-level Image Tokenization"☆58Updated 4 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆31Updated last year
- [NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"☆160Updated 6 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆75Updated 11 months ago
- Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)☆97Updated 5 months ago
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆127Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆66Updated 7 months ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆51Updated last year
- ☆72Updated 5 months ago
- ☆50Updated 2 years ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆96Updated 2 months ago
- Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning☆113Updated last year
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆32Updated 6 months ago
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆67Updated this week
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆25Updated last month
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆61Updated 4 months ago
- Visualizing representations with diffusion based conditional generative model.☆84Updated last year
- Code for Finetune like you pretrain: Improved finetuning of zero-shot vision models☆86Updated last year
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆27Updated 9 months ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆75Updated 8 months ago
- Patching open-vocabulary models by interpolating weights☆88Updated 11 months ago
- Matryoshka Multimodal Models☆67Updated 3 weeks ago
- ☆111Updated last year
- Create generated datasets and train robust classifiers☆35Updated last year
- ☆29Updated 2 months ago