BatsResearch / fudd
Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification
☆11Updated last year
Alternatives and similar repositories for fudd:
Users that are interested in fudd are comparing it to the libraries listed below
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆15Updated 11 months ago
- Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts…☆56Updated last year
- Learning to compose soft prompts for compositional zero-shot learning.☆88Updated last year
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆48Updated 4 months ago
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆41Updated last week
- ☆26Updated last year
- ☆21Updated 9 months ago
- Code and results accompanying our paper titled CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets☆56Updated last year
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆13Updated last year
- Official Implementation of LADS (Latent Augmentation using Domain descriptionS)☆49Updated last year
- code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720☆55Updated 9 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- Official implementation for CVPR'23 paper "BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning"☆110Updated last year
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- [CVPR 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding☆45Updated 7 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- This repository houses the code for the paper - "The Neglected of VLMs"☆28Updated 3 months ago
- [NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization☆103Updated last year
- ☆57Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆10Updated last year
- [CVPR 2023] Improving Zero-shot Generalization and Robustness of Multi-modal Models☆31Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆35Updated last year
- [NeurIPS 2024] Code for Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models☆35Updated this week
- ☆22Updated 9 months ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Updated last year
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision☆34Updated 4 months ago
- Dataset accompanying the paper "Adaptive Methods for Real-World Domain Generalization"☆15Updated last year
- Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"☆32Updated 5 months ago