facebookresearch / clevr-dataset-genView external linksLinks
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆643Aug 30, 2021Updated 4 years ago
Alternatives and similar repositories for clevr-dataset-gen
Users that are interested in clevr-dataset-gen are comparing it to the libraries listed below
Sorting:
- Inferring and Executing Programs for Visual Reasoning☆801Aug 30, 2021Updated 4 years ago
- Neural-symbolic visual question answering☆280Mar 27, 2023Updated 2 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆138Aug 4, 2024Updated last year
- Multi-object image datasets with ground-truth segmentation masks and generative factors.☆281Dec 17, 2021Updated 4 years ago
- Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)☆512Jul 10, 2021Updated 4 years ago
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆49Feb 18, 2020Updated 5 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆108Dec 18, 2020Updated 5 years ago
- Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)☆818Dec 6, 2022Updated 3 years ago
- Train embodied agents that can answer questions in environments☆316Jul 25, 2023Updated 2 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆126Feb 11, 2020Updated 6 years ago
- PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).☆450Oct 24, 2020Updated 5 years ago
- PyTorch implementation of "Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning"☆347Dec 7, 2021Updated 4 years ago
- Dataset generator for the realistic blocksworld environment☆24Sep 14, 2022Updated 3 years ago
- Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017☆272Jul 30, 2020Updated 5 years ago
- Visual Question Answering in Pytorch☆734Dec 11, 2019Updated 6 years ago
- PHYRE is a benchmark for physical reasoning.☆458Jul 8, 2023Updated 2 years ago
- Structured Attentions for Visual Question Answering☆46Mar 4, 2018Updated 7 years ago
- Burgess et al. "MONet: Unsupervised Scene Decomposition and Representation"☆89Dec 3, 2022Updated 3 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆90Feb 24, 2024Updated last year
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆128Nov 6, 2020Updated 5 years ago
- Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17☆163Feb 8, 2019Updated 7 years ago
- ☆42Jan 22, 2024Updated 2 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- "Scene Graph Generation by Iterative Message Passing" code repository☆435Mar 27, 2019Updated 6 years ago
- Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 2018☆1,322Jul 25, 2024Updated last year
- An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.☆765Mar 10, 2024Updated last year
- Visual Question Answering Project with state of the art single Model performance.☆131Jun 18, 2018Updated 7 years ago
- a Realistic and Rich 3D Environment☆1,203Jul 6, 2020Updated 5 years ago
- Neural Module Network for VQA in Pytorch☆107Dec 16, 2017Updated 8 years ago
- RAVEN: A Dataset for Relational and Analogical Visual rEasoNing☆190Apr 12, 2025Updated 10 months ago
- Code for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]☆67Aug 3, 2023Updated 2 years ago
- Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome☆1,465Feb 3, 2023Updated 3 years ago
- Neural Scene De-rendering☆64Oct 23, 2017Updated 8 years ago
- Dataset to assess the disentanglement properties of unsupervised learning methods☆526Jan 3, 2021Updated 5 years ago
- disentanglement_lib is an open-source library for research on learning disentangled representations.☆1,420May 16, 2021Updated 4 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)☆5,616Jan 12, 2026Updated last month
- An implementation of the MONet model for unsupervised scene decomposition in PyTorch☆59May 16, 2022Updated 3 years ago
- This repository provides code for reproducing experiments of the paper Talk The Walk: Navigating New York City Through Grounded Dialogue …☆110Aug 12, 2021Updated 4 years ago