kexinyi / ns-vqa
Neural-symbolic visual question answering
☆263Updated last year
Alternatives and similar repositories for ns-vqa:
Users that are interested in ns-vqa are comparing it to the libraries listed below
- PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).☆422Updated 4 years ago
- Neural State Machine implemented in PyTorch☆71Updated 5 years ago
- PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"☆115Updated 4 years ago
- A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset☆88Updated 5 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆607Updated 3 years ago
- Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.☆133Updated last year
- Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)☆465Updated 3 years ago
- Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep lea…☆309Updated 2 years ago
- Grounded SCAN data set.☆69Updated 3 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆121Updated last year
- Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog☆46Updated 5 years ago
- Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)☆501Updated 3 years ago
- PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"☆127Updated 3 years ago
- Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks☆20Updated 2 years ago
- Grid features pre-training code for visual question answering☆269Updated 3 years ago
- Multi-object image datasets with ground-truth segmentation masks and generative factors.☆266Updated 3 years ago
- CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning☆104Updated 4 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆90Updated last year
- Scene Graphs with Permutation-Invariant Structured Prediction☆72Updated 2 years ago
- Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.☆132Updated 2 years ago
- Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "☆94Updated 6 years ago
- Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering☆150Updated 6 years ago
- Bongard-LOGO is a Python code repository with the purpose of generating synthetic Bongard problems on a large scale with little human int…☆51Updated 2 years ago
- Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379☆96Updated 4 years ago
- Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"☆123Updated 5 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆223Updated 3 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Updated 6 years ago
- Personal python toolbox.☆140Updated last week
- Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)☆29Updated 3 years ago
- Contrastive Learning of Structured World Models☆391Updated 4 years ago