mair-lab / maplLinks

☆30

Alternatives and similar repositories for mapl

Users that are interested in mapl are comparing it to the libraries listed below

Sorting:

ajd12342 / why-winoground-hard
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31Updated 2 years ago
McGill-NLP / imagecode
Code and data for ImageCoDe, a contextual vison-and-language benchmark
☆41Updated last year
google-deepmind / svo_probes
The SVO-Probes Dataset for Verb Understanding
☆31Updated 4 years ago
RAIVNLab / sugar-crepe
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆89Updated last year
BatsResearch / csp
Learning to compose soft prompts for compositional zero-shot learning.
☆93Updated 4 months ago
facebookresearch / reliable_vqa
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆38Updated 2 years ago
allenai / aokvqa
Official repository for the A-OKVQA dataset
☆109Updated last year
ylsung / VL_adapter
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆209Updated 3 years ago
allenai / sherlock
Code, data, models for the Sherlock corpus
☆59Updated 3 years ago
e-bug / iglue
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Updated 3 years ago
maximek3 / e-ViL
☆40Updated 3 years ago
MikeWangWZHL / Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆37Updated 2 years ago
ExplainableML / CLEVR-X
CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
☆29Updated 2 years ago
allenai / close
☆59Updated 2 years ago
limanling / KnowledgeVL-Reading
☆67Updated 2 years ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆38Updated 2 years ago
Letian2003 / C-VQA
Counterfactual Reasoning VQA Dataset
☆27Updated 2 years ago
sIncerass / MVLPT
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
☆56Updated last year
microsoft / PICa
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI 2022 (Oral)
☆87Updated 3 years ago
mertyg / vision-language-models-are-bows
Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …
☆291Updated 2 years ago
vipulgupta1011 / swapmix
☆20Updated 3 years ago
edchengg / infoseek_eval
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆25Updated last year
zmykevin / UVLP
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆22Updated 3 years ago
Computer-Vision-in-the-Wild / Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
☆76Updated 2 years ago
eric-ai-lab / CPL
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆35Updated 3 years ago
edchengg / oven_eval
ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities
☆43Updated 8 months ago
LisaAnne / Hallucination
☆93Updated 6 years ago
lancopku / clip-openness
[ACL 2023] Delving into the Openness of CLIP
☆24Updated 3 years ago
PLUM-Lab / MultiInstruct
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
☆134Updated 2 years ago
cdancette / detect-shortcuts
Repo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
☆28Updated last year