Stanford-AIMI / RaVL
[NeurIPS 2024] RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models
☆14Updated 2 months ago
Alternatives and similar repositories for RaVL:
Users that are interested in RaVL are comparing it to the libraries listed below
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆47Updated last month
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated this week
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆72Updated 4 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆29Updated 3 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆38Updated 9 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆19Updated 2 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated 7 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- ☆21Updated 3 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆27Updated 10 months ago
- More dimensions = More fun☆21Updated 5 months ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Updated last year
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆15Updated 6 months ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 5 months ago
- ☆37Updated 2 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆16Updated 3 weeks ago
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 5 months ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations"☆31Updated last year
- ☆40Updated this week
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆49Updated 3 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆27Updated last year
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated last month
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆28Updated 3 months ago
- ☆16Updated 3 months ago
- Codebase for adaptive continual memory☆13Updated last year
- ☆31Updated 11 months ago
- Official Repository of Personalized Visual Instruct Tuning☆26Updated 2 months ago