eric-ai-lab/CPL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eric-ai-lab/CPL)

eric-ai-lab / CPL

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

☆35

Alternatives and similar repositories for CPL

Users that are interested in CPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzxslp / XL-VLN
View on GitHub
Dataset for Bilingual VLN
☆11Dec 5, 2020Updated 5 years ago
allenai / x-lxmert
View on GitHub
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"
☆50Aug 27, 2021Updated 4 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
VegB / VLN-Transformer
View on GitHub
Implementation of "Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation"
☆27Mar 4, 2021Updated 5 years ago
deeplearning-wisc / mllmshift-emi
View on GitHub
Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"
☆12May 27, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CHENGY12 / PLOT
View on GitHub
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
☆176Dec 14, 2023Updated 2 years ago
zhegan27 / LXMERT-AdvTrain
View on GitHub
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…
☆21Oct 20, 2020Updated 5 years ago
YujieLu10 / IACE-NLU
View on GitHub
Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.
☆17Aug 30, 2022Updated 3 years ago
yuhangzang / UPT
View on GitHub
☆61May 2, 2025Updated last year
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated last year
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 2 years ago
byM1902 / ViT_visualization
View on GitHub
☆12May 26, 2022Updated 3 years ago
eric-ai-lab / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xu1998hz / SEScore2
View on GitHub
☆17Mar 3, 2025Updated last year
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 2 months ago
McGill-NLP / AURORA
View on GitHub
Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
☆35Jun 30, 2025Updated 10 months ago
windx0303 / VIST-Challenge-NAACL-2018
View on GitHub
Official Github repo of the VIST Challenge NAACL 2018
☆17Aug 3, 2018Updated 7 years ago
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆211Dec 18, 2022Updated 3 years ago
eric-ai-lab / MMWorld
View on GitHub
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
☆28Jul 15, 2025Updated 10 months ago
jialuli-luka / EnvEdit
View on GitHub
Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)
☆30Aug 2, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
weixi-feng / TC-Bench
View on GitHub
☆27Jun 22, 2024Updated last year
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 5 years ago
aws / aws-refcocog-adv
View on GitHub
☆22Jan 14, 2026Updated 4 months ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
1429904852 / R-GCN
View on GitHub
[ACMMM 2022] Learning from Different text-image Pairs: A Relation-enhanced Graph Convolutional Network for Multimodal NER
☆18Feb 21, 2023Updated 3 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆79Nov 24, 2022Updated 3 years ago
lemon0830 / promptCSE
View on GitHub
code for promptCSE, emnlp 2022
☆11Apr 10, 2023Updated 3 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ronghanghu / speaker_follower
View on GitHub
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆137Nov 22, 2022Updated 3 years ago
594zyc / HiTUT
View on GitHub
Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…
☆24Jun 28, 2021Updated 4 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
lichengunc / vist_eval
View on GitHub
vist story telling evaluation tool
☆21Dec 5, 2023Updated 2 years ago
sergiotasconmorales / consistency_vqa
View on GitHub
Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)
☆26Mar 28, 2023Updated 3 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
kuiy / Causal-Autoencoder-CAE-
View on GitHub
Source codes of Learning Causal Representations for Robust Domain Adaptation (IEEE TKDE)
☆12Feb 14, 2022Updated 4 years ago