Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual input and a sentence describing it, and is annotated with the truth-value of the sentence.
☆269Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for nlvr
Users that are interested in nlvr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- A (growing) collection of useful abstractions and implementations for research.☆17Jan 20, 2020Updated 6 years ago
- Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"☆16May 6, 2020Updated 6 years ago
- ☆22Jan 14, 2026Updated 4 months ago
- [TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-La…☆115Mar 24, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".☆967Oct 22, 2022Updated 3 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- Dataset and starting code for visual entailment dataset☆121Apr 21, 2022Updated 4 years ago
- Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"☆800Jun 30, 2021Updated 4 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Apr 2, 2019Updated 7 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆523Dec 8, 2021Updated 4 years ago
- ☆478Nov 21, 2022Updated 3 years ago
- ☆11May 24, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆166Mar 1, 2017Updated 9 years ago
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆769Mar 10, 2024Updated 2 years ago
- Translating neuralese☆46Apr 26, 2017Updated 9 years ago
- ☆92Apr 15, 2022Updated 4 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].☆11Oct 31, 2022Updated 3 years ago
- ☆1,221May 13, 2024Updated 2 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- ☆24Dec 22, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Dynamic Robot Instruction Following☆42Dec 28, 2021Updated 4 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"☆543May 1, 2023Updated 3 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,469Feb 3, 2023Updated 3 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆272Jul 30, 2020Updated 5 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆181Oct 25, 2022Updated 3 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- ☆179Jul 31, 2020Updated 5 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022