IntelLabs / VL-InterpreTLinks

Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers

☆97

Alternatives and similar repositories for VL-InterpreT

Users that are interested in VL-InterpreT are comparing it to the libraries listed below

Sorting:

goel-shashank / CyCLIP
☆120Updated 2 years ago
microsoft / FIBER
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆130Updated 2 years ago
facebookresearch / diht
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆138Updated 2 years ago
ylsung / VL_adapter
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆207Updated 2 years ago
mlfoundations / patching
Patching open-vocabulary models by interpolating weights
☆91Updated 2 years ago
McGill-NLP / imagecode
Code and data for ImageCoDe, a contextual vison-and-language benchmark
☆41Updated last year
allenai / unified-io-inference
☆229Updated last year
facebookresearch / OTTER
This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …
☆69Updated 3 years ago
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆34Updated 2 years ago
fawazsammani / nlxgpt
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
☆48Updated last year
mbanani / lgssl
[CVPR 2023] Learning Visual Representations via Language-Guided Sampling
☆149Updated 2 years ago
ajd12342 / why-winoground-hard
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31Updated 2 years ago
RAIVNLab / sugar-crepe
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆88Updated last year
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
igorbrigadir / DownloadConceptualCaptions
Reliably download millions of images efficiently
☆118Updated 4 years ago
Computer-Vision-in-the-Wild / Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
☆76Updated 2 years ago
google-deepmind / svo_probes
The SVO-Probes Dataset for Verb Understanding
☆31Updated 3 years ago
jmerullo / limber
https://arxiv.org/abs/2209.15162
☆53Updated 2 years ago
LAION-AI / scaling-laws-openclip
Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)
☆179Updated 5 months ago
cambridgeltl / visual-spatial-reasoning
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
☆133Updated 2 years ago
mlfoundations / clip_quality_not_quantity
☆29Updated 3 years ago
mertyg / vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …
☆286Updated 2 years ago
redcaps-dataset / redcaps-downloader
Command-line tool for downloading and extending the RedCaps dataset.
☆50Updated last year
allenai / sherlock
Code, data, models for the Sherlock corpus
☆58Updated 3 years ago
HendrikStrobelt / miniClip
☆47Updated 6 months ago
allenai / close
☆59Updated 2 years ago
young-geng / m3ae_public
Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆103Updated 8 months ago
Weixin-Liang / MetaShift
MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)
☆109Updated 3 years ago
microsoft / BridgeTower
Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"
☆166Updated 2 years ago
shizhediao / DaVinci
Source code for the paper "Prefix Language Models are Unified Modal Learners"
☆43Updated 2 years ago