zhegan27 / LXMERT-AdvTrainView external linksLinks
Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT adversarial training part
☆21Oct 20, 2020Updated 5 years ago
Alternatives and similar repositories for LXMERT-AdvTrain
Users that are interested in LXMERT-AdvTrain are comparing it to the libraries listed below
Sorting:
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": UNITER…☆119Jan 13, 2021Updated 5 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago
- PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model☆16Oct 3, 2023Updated 2 years ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 2 years ago
- source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT☆72Nov 14, 2022Updated 3 years ago
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆966Oct 22, 2022Updated 3 years ago
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆14May 10, 2021Updated 4 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- A Large-Scale Dataset for Paraphrased Reading Comprehension☆15Jul 16, 2023Updated 2 years ago
- Data of ACL 2019 Paper "Expressing Visual Relationships via Language".☆62Sep 30, 2020Updated 5 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18May 10, 2023Updated 2 years ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- [Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph☆72Feb 9, 2024Updated 2 years ago
- Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671☆18May 6, 2021Updated 4 years ago
- Implementation of "Visualize Before You Write: Imagination-Guided Open-Ended Text Generation".☆17Feb 3, 2023Updated 3 years ago
- Official repo for "Imagination-Augmented Natural Language Understanding", NAACL 2022.☆17Aug 30, 2022Updated 3 years ago
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆540May 1, 2023Updated 2 years ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆114Mar 24, 2022Updated 3 years ago
- ☆16Feb 28, 2023Updated 2 years ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆51Aug 20, 2022Updated 3 years ago
- Code for the paper "Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets", to be presented at NAACL 2019.☆20Apr 4, 2019Updated 6 years ago
- The dataset and code for the paper "Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information"☆19Oct 28, 2019Updated 6 years ago
- Code for "Neural Speed Reading with Structural-Jump-LSTM" ICLR 2019☆25Feb 22, 2019Updated 6 years ago
- ☆22Aug 10, 2020Updated 5 years ago
- Pre-trained V+L Data Preparation☆46Jun 2, 2020Updated 5 years ago
- Linguistically-Informed Self-Attention for Semantic Role Labeling (old version)☆27Dec 3, 2021Updated 4 years ago
- Grid features pre-training code for visual question answering☆269Sep 17, 2021Updated 4 years ago
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆24Dec 4, 2023Updated 2 years ago
- Instruction to data diversification☆24Nov 24, 2020Updated 5 years ago
- MFURLN relationship detection method☆21May 17, 2020Updated 5 years ago
- Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]☆63Aug 20, 2021Updated 4 years ago
- ☆27Oct 7, 2021Updated 4 years ago