orrzohar / LOVM
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆18Updated 7 months ago
Related projects: ⓘ
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆31Updated last year
- Create generated datasets and train robust classifiers☆35Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated 11 months ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆21Updated 9 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆26Updated 2 months ago
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆17Updated 9 months ago
- ☆25Updated 7 months ago
- ☆11Updated 2 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆11Updated 7 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆22Updated 3 months ago
- ☆22Updated last year
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Updated 2 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated last year
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆51Updated last year
- ☆18Updated 2 years ago
- [CVPR 2023] Improving Zero-shot Generalization and Robustness of Multi-modal Models☆29Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆32Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆31Updated last year
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆13Updated last year
- LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images☆27Updated 9 months ago
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval". ICCV 2023☆11Updated 11 months ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 4 months ago
- NegCLIP.☆23Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆33Updated 8 months ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆25Updated 2 years ago
- Augmenting with Language-guided Image Augmentation (ALIA)☆62Updated 10 months ago
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- ImageNetV2 Pytorch Dataset☆36Updated last year
- Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation☆43Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13Updated last year