jackroos / VL-BERT
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
☆741Updated last year
Alternatives and similar repositories for VL-BERT:
Users that are interested in VL-BERT are comparing it to the libraries listed below
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆942Updated 2 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆787Updated 3 years ago
- ☆473Updated 2 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆531Updated last year
- Multi Task Vision and Language☆804Updated 2 years ago
- Oscar and VinVL☆1,039Updated last year
- Deep Modular Co-Attention Networks for Visual Question Answering☆448Updated 4 years ago
- Vision-Language Pre-training for Image Captioning and Question Answering☆417Updated 3 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆713Updated last year
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆348Updated 5 years ago
- Recent Advances in Vision and Language PreTrained Models (VL-PTMs)