Visual Question Reasoning on General Dependency Tree
☆30Updated this week
Alternatives and similar repositories for ACMN-Pytorch
Users that are interested in ACMN-Pytorch are comparing it to the libraries listed below
Sorting:
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- MUREL (CVPR 2019), a multimodal relational reasoning module for VQA☆195Feb 9, 2020Updated 6 years ago
- Event based Sign-Language-Translation☆19Updated this week
- Models and Codes for the paper Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions☆14Aug 6, 2018Updated 7 years ago
- Visually Grounded PCFG Induction☆39May 18, 2022Updated 3 years ago
- PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations☆42Jun 30, 2021Updated 4 years ago
- PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters☆20Jan 4, 2023Updated 3 years ago
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆27Jul 29, 2024Updated last year
- Texar (tf-backend) implementation of "GTAE: Graph-Transformer Based Auto Encoder for Text Style Transfer"☆45Aug 2, 2024Updated last year
- Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos☆71Sep 7, 2021Updated 4 years ago
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆20Dec 28, 2021Updated 4 years ago
- PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision☆46Jul 29, 2020Updated 5 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- TensorFlow code for paper: Learning Generative ConvNets via Multi-grid Modeling and Sampling☆25Apr 16, 2018Updated 7 years ago
- Inferring and Executing Programs for Visual Reasoning☆21Jan 4, 2019Updated 7 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Sep 1, 2018Updated 7 years ago
- Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "☆93Mar 17, 2019Updated 6 years ago
- ☆29Mar 24, 2018Updated 7 years ago
- Visual Relation Grounding in Videos (ECCV'20, Spotlight)☆57Dec 8, 2022Updated 3 years ago
- Dual-path Dynamic Enhancement Network for RealSR, IEEE SPL 2020☆25Mar 23, 2020Updated 5 years ago
- [AAAI 2020] (oral) Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing☆62Jul 22, 2020Updated 5 years ago
- The toolbox for the Google Refexp dataset proposed in this paper: http://arxiv.org/abs/1511.02283☆166Mar 1, 2017Updated 9 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆31Apr 9, 2019Updated 6 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Apr 16, 2023Updated 2 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆457Dec 16, 2020Updated 5 years ago
- Code for the NeurIPS 2019 paper: "Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning"☆33Jun 27, 2023Updated 2 years ago
- The source code of ACL 2020 paper: "Cross-Modality Relevance for Reasoning on Language and Vision"☆27May 6, 2021Updated 4 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019☆31Apr 21, 2021Updated 4 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆33Jul 15, 2022Updated 3 years ago
- Code for CVPR'18 "Grounding Referring Expressions in Images by Variational Context"☆30Jul 4, 2018Updated 7 years ago
- Graph-Structured Referring Expressions Reasoning in The Wild, In CVPR 2020, Oral.☆116Aug 10, 2020Updated 5 years ago
- ☆32Nov 15, 2020Updated 5 years ago
- Code for CVPR 2019 paper: " Learning Deep Compositional Grammatical Architectures for Visual Recognition"☆131Jun 24, 2019Updated 6 years ago
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries, ECCV 2018☆76Sep 21, 2021Updated 4 years ago
- Implementation for CVPR 2020 Paper "Two Causal Principles for Improving Visual Dialog"☆31Feb 19, 2023Updated 3 years ago
- Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch☆85Feb 5, 2019Updated 7 years ago
- "CCNLab: A Benchmarking Framework for Computational Cognitive Neuroscience" (NeurIPS 2021)☆10Jul 12, 2021Updated 4 years ago
- ☆10Dec 16, 2023Updated 2 years ago