kexinyi / ns-vqaLinks

Neural-symbolic visual question answering

☆269

Alternatives and similar repositories for ns-vqa

Users that are interested in ns-vqa are comparing it to the libraries listed below

Sorting:

vacancy / NSCL-PyTorch-Release
PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).
☆432Updated 4 years ago
WellyZhang / RAVEN
RAVEN: A Dataset for Relational and Analogical Visual rEasoNing
☆171Updated 3 months ago
ceyzaguirre4 / NSM
Neural State Machine implemented in PyTorch
☆71Updated 5 years ago
stanfordnlp / mac-network
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
☆501Updated 4 years ago
chuangg / CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
☆121Updated 4 years ago
facebookresearch / clevr-dataset-gen
A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
☆623Updated 3 years ago
rowanz / r2c
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
☆469Updated 4 years ago
LauraRuis / groundedSCAN
Grounded SCAN data set.
☆69Updated 3 years ago
mesnico / RelationNetworks-CLEVR
A pytorch implementation for "A simple neural network module for relational reasoning", working on the CLEVR dataset
☆88Updated 5 years ago
Fen9 / WReN
A Pytorch implementation of "Measuring abstract reasoning in neural networks" in ICML 2018 by DeepMind
☆38Updated 2 years ago
floodsung / Deep-Reasoning-Papers
Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep lea…
☆314Updated 3 years ago
rohitgirdhar / CATER
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
☆105Updated 4 years ago
lil-lab / touchdown
Cornell Touchdown natural language navigation and spatial reasoning dataset.
☆102Updated 4 years ago
google-deepmind / multi_object_datasets
Multi-object image datasets with ground-truth segmentation masks and generative factors.
☆272Updated 3 years ago
rowanz / merlot
MERLOT: Multimodal Neural Script Knowledge Models
☆224Updated 3 years ago
satwikkottur / clevr-dialog
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
☆47Updated 5 years ago
liqing-ustc / NGS
Neural-Grammar-Symbolic Learning with Back-Search
☆54Updated 11 months ago
Cyanogenoid / pytorch-vqa
Strong baseline for visual question answering
☆240Updated 2 years ago
sjtuytc / Neurips21-ProTo-Program-guided-Transformers-for-Program-guided-Tasks
Official code repo for "ProTo: program-guided Transformers for Program-guided Tasks
☆21Updated 3 years ago
ExplorerFreda / VGNSL
[ACL 2019] Visually Grounded Neural Syntax Acquisition
☆90Updated last year
chihyaoma / selfmonitoring-agent
PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
☆122Updated last year
facebookresearch / EmbodiedQA
Train embodied agents that can answer questions in environments
☆307Updated last year
danielgordon10 / thor-iqa-cvpr-2018
Repository containing code for the paper "IQA: Visual Question Answering in Interactive Environments"
☆124Updated 5 years ago
zilongzheng / visdial-gnn
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
☆42Updated 4 years ago
Sha-Lab / babywalk
PyTorch code for the ACL 2020 paper: "BabyWalk: Going Farther in Vision-and-Language Navigationby Taking Baby Steps"
☆42Updated 3 years ago
alexa / teach
TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.
☆140Updated last year
airsplay / R2R-EnvDrop
PyTorch Code of NAACL 2019 paper "Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout"
☆132Updated 3 years ago
ronghanghu / speaker_follower
Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.
☆133Updated 2 years ago
jialinwu17 / self_critical_vqa
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆41Updated 5 years ago
alexpashevich / E.T.
Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…
☆90Updated 2 years ago