Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
☆267Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for nlvr
Users that are interested in nlvr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- A (growing) collection of useful abstractions and implementations for research.☆17Jan 20, 2020Updated 6 years ago
- Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"☆16May 6, 2020Updated 5 years ago
- ☆22Jan 14, 2026Updated 2 months ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆115Mar 24, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆966Oct 22, 2022Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- Dataset and starting code for visual entailment dataset☆120Apr 21, 2022Updated 3 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Apr 2, 2019Updated 7 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆521Dec 8, 2021Updated 4 years ago
- ☆478Nov 21, 2022Updated 3 years ago
- ☆11May 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆166Mar 1, 2017Updated 9 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆766Mar 10, 2024Updated 2 years ago
- Translating neuralese☆46Apr 26, 2017Updated 8 years ago
- ☆90Apr 15, 2022Updated 3 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].☆11Oct 31, 2022Updated 3 years ago
- ☆1,218May 13, 2024Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- ☆24Dec 22, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Dynamic Robot Instruction Following☆40Dec 28, 2021Updated 4 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆541May 1, 2023Updated 2 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,467Feb 3, 2023Updated 3 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆272Jul 30, 2020Updated 5 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆182Oct 25, 2022Updated 3 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- ☆178Jul 31, 2020Updated 5 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆268Oct 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MAttNet: Modular Attention Network for Referring Expression Comprehension☆298Nov 29, 2022Updated 3 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Inferring and Executing Programs for Visual Reasoning☆805Aug 30, 2021Updated 4 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆645Aug 30, 2021Updated 4 years ago
- [EMNLP 2017] Code for "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog"☆95May 5, 2020Updated 5 years ago
- Demonstration code for Liang and Potts 2015☆79Aug 27, 2016Updated 9 years ago
- PyTorch code for EMNLP 2020 paper "X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers"☆50Aug 27, 2021Updated 4 years ago