UCSB-AI/CPL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UCSB-AI/CPL)

UCSB-AI / CPL

Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"

☆35

Alternatives and similar repositories for CPL

Users that are interested in CPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / x-lxmert
View on GitHub
PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"
☆50Aug 27, 2021Updated 4 years ago
jochemloedeman / PGN
View on GitHub
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…
☆44Sep 11, 2024Updated last year
deeplearning-wisc / mllmshift-emi
View on GitHub
Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"
☆12May 27, 2025Updated last year
CHENGY12 / PLOT
View on GitHub
[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
☆177Dec 14, 2023Updated 2 years ago
YujieLu10 / IACE-NLU
View on GitHub
Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.
☆17Aug 30, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yuhangzang / UPT
View on GitHub
☆61May 2, 2025Updated last year
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated last year
UCSB-AI / Discffusion
View on GitHub
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Apr 27, 2024Updated 2 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
xu1998hz / SEScore2
View on GitHub
☆17Mar 3, 2025Updated last year
XMUVQA / CapsAtt
View on GitHub
Project for Dynamic Capsule Attention
☆12Dec 7, 2019Updated 6 years ago
BierOne / relation-vqa
View on GitHub
Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.
☆12Mar 13, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
windx0303 / VIST-Challenge-NAACL-2018
View on GitHub
Official Github repo of the VIST Challenge NAACL 2018
☆17Aug 3, 2018Updated 7 years ago
iriscxy / VMSMO
View on GitHub
Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
☆36Jul 30, 2021Updated 4 years ago
ylsung / VL_adapter
View on GitHub
PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)
☆212Dec 18, 2022Updated 3 years ago
weixi-feng / TC-Bench
View on GitHub
☆27Jun 22, 2024Updated 2 years ago
UCSB-AI / FedVLN
View on GitHub
[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"
☆14Oct 8, 2022Updated 3 years ago
jialuli-luka / EnvEdit
View on GitHub
Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)
☆30Aug 2, 2022Updated 3 years ago
shengyuzhang / DeVLBert
View on GitHub
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
☆27Nov 27, 2022Updated 3 years ago
art2611 / ML-MDA
View on GitHub
Person-ReID code for paper introducing the ML-MDA approach.
☆16Jun 14, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
1429904852 / R-GCN
View on GitHub
[ACMMM 2022] Learning from Different text-image Pairs: A Relation-enhanced Graph Convolutional Network for Multimodal NER
☆18Feb 21, 2023Updated 3 years ago
lemon0830 / promptCSE
View on GitHub
code for promptCSE, emnlp 2022
☆11Apr 10, 2023Updated 3 years ago
alibabadoufu / dynamic_fusion_reimplementation
View on GitHub
Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering
☆17Oct 30, 2019Updated 6 years ago
ronghanghu / speaker_follower
View on GitHub
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆138Nov 22, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
emerisly / EDIS
View on GitHub
Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)
☆26Dec 2, 2023Updated 2 years ago
lichengunc / vist_eval
View on GitHub
vist story telling evaluation tool
☆21Dec 5, 2023Updated 2 years ago
YichaoCai1 / CLAP
View on GitHub
Official Implementation of the ECCV 2024 Paper: "CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts"
☆56Oct 24, 2025Updated 9 months ago
VegB / iNLG
View on GitHub
Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".
☆17Feb 3, 2023Updated 3 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
sIncerass / MVLPT
View on GitHub
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
☆57Jun 5, 2024Updated 2 years ago
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago