Pual2013 / FRPTLinks

Fine-grained Retrieval Prompt Tuning

☆3

Alternatives and similar repositories for FRPT

Users that are interested in FRPT are comparing it to the libraries listed below

Sorting:

LooperXX / ManagerTower
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆11Updated 7 months ago
michelecafagna26 / VinVL
Original VinVL (and Oscar) repo with API designed for an easy inference
☆8Updated 2 years ago
BierOne / Attention-Faithfulness
[ICML 2022] This is the pytorch implementation of "Rethinking Attention-Model Explainability through Faithfulness Violation Test" (https:…
☆19Updated 3 years ago
zmykevin / UVLP
CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
☆22Updated 3 years ago
facebookresearch / reliable_vqa
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆35Updated 2 years ago
zhjohnchan / bert-clip-synesthesia
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Updated 2 years ago
eric-ai-lab / CPL
Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"
☆34Updated 2 years ago
JiwanChung / vlis
☆24Updated last year
zinengtang / VidLanKD
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Updated 2 years ago
sIncerass / MVLPT
code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720
☆56Updated last year
prdwb / okvqa-release
☆14Updated 4 years ago
naver-ai / eccv-caption
Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)
☆55Updated last year
lancopku / IAIS
[ACL 2021] Learning Relation Alignment for Calibrated Cross-modal Retrieval
☆31Updated 2 years ago
mlfoundations / clip_quality_not_quantity
☆29Updated 2 years ago
google-research / fnc
☆28Updated 3 years ago
woojeongjin / FewVLM
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
☆42Updated 3 years ago
fawazsammani / nlxgpt
NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)
☆48Updated last year
easonnie / mlp-vil
MLPs for Vision and Langauge Modeling (Coming Soon)
☆27Updated 3 years ago
kugwzk / DiDE
Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”
☆30Updated 2 years ago
e-bug / iglue
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Updated 2 years ago
YuanEZhou / CBTrans
☆22Updated 3 years ago
zmykevin / UC2
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Updated 3 years ago
YehLi / TDEN
☆9Updated 2 years ago
zinengtang / Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
☆33Updated 2 years ago
Victorwz / VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Updated 2 years ago
RitaRamo / extra
Retrieval-augmented Image Captioning
☆13Updated 2 years ago
e-bug / cross-modal-ablation
[EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…
☆20Updated 3 years ago
yangxuntu / catt
☆12Updated 4 years ago
Sreyan88 / ACLM
Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
☆20Updated 2 years ago
naver-ai / pcmepp
Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)
☆57Updated last year