ronilp/mac-network-pytorch-gqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ronilp/mac-network-pytorch-gqa)

ronilp / mac-network-pytorch-gqa

Memory, Attention and Composition (MAC) Network for CLEVR/GQA implemented in PyTorch

☆27

Alternatives and similar repositories for mac-network-pytorch-gqa

Users that are interested in mac-network-pytorch-gqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rosinality / mac-network-pytorch
View on GitHub
Memory, Attention and Composition (MAC) Network for CLEVR implemented in PyTorch
☆85Feb 5, 2019Updated 7 years ago
wenhuchen / Meta-Module-Network
View on GitHub
Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"
☆43May 13, 2021Updated 5 years ago
ronghanghu / gqa_single_hop_baseline
View on GitHub
A simple but well-performing "single-hop" visual attention model for the GQA dataset
☆20Aug 8, 2019Updated 6 years ago
microsoft / DFOL-VQA
View on GitHub
Differentiable First-Order Logic Reasoning for Visual Question Answering
☆45Mar 7, 2021Updated 5 years ago
ceyzaguirre4 / NSM
View on GitHub
Neural State Machine implemented in PyTorch
☆71Oct 10, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kakao / DAFT
View on GitHub
Code for the NeurIPS 2019 paper: "Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning"
☆33Jun 27, 2023Updated 3 years ago
stanfordnlp / mac-network
View on GitHub
Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)
☆511Jul 10, 2021Updated 5 years ago
bknyaz / sgg
View on GitHub
Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization [BMVC 2020, ICCV …
☆143Jun 18, 2023Updated 3 years ago
nlpai-lab / Korean-CommonGen
View on GitHub
[Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
☆11May 27, 2022Updated 4 years ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
gorjanradevski / sr-bert
View on GitHub
Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).
☆11Feb 10, 2023Updated 3 years ago
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
escorciav / deep-action-proposals
View on GitHub
Action Proposals generated by deep models
☆29Mar 19, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Clearloveyuan / awesome-Radiology-Report-Generation
View on GitHub
Paper List about Radiology Report Generation and also some medical image captioning
☆11Oct 5, 2021Updated 4 years ago
yuleiniu / cfvqa
View on GitHub
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
☆136Dec 15, 2021Updated 4 years ago
AwalkZY / CPN
View on GitHub
Code for CVPR2021 Paper “Cascaded Prediction Network via Segment Tree for Temporal Video Grounding”
☆10Apr 3, 2022Updated 4 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
shtechair / vqa-sva
View on GitHub
Structured Attentions for Visual Question Answering
☆46Mar 4, 2018Updated 8 years ago
mlzxy / LipRead
View on GitHub
A Lip Reading Neural Network using LSTM, implemented upon keras
☆17Mar 16, 2016Updated 10 years ago
yytzsy / grounding_changing_distribution
View on GitHub
☆36Apr 14, 2021Updated 5 years ago
yuleiniu / introd
View on GitHub
[NeurIPS 2021] Introspective Distillation for Robust Question Answering
☆13Dec 7, 2021Updated 4 years ago
cuiyuhao1996 / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆10Jul 10, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CCYChongyanChen / VQA_AlgorithmDatasets
View on GitHub
☆37Jan 20, 2023Updated 3 years ago
hengyuan-hu / bottom-up-attention-vqa
View on GitHub
An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.
☆768Mar 10, 2024Updated 2 years ago
wx-zhang / spu
View on GitHub
☆16Jul 1, 2024Updated 2 years ago
UMass-Embodied-AGI / VisualCoT
View on GitHub
Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning
☆40Mar 12, 2025Updated last year
jungokasai / THumB
View on GitHub
☆15Apr 8, 2022Updated 4 years ago
Jerenyaoyelu / Python-Programming---COMP9021
View on GitHub
all works in the course
☆15Mar 28, 2019Updated 7 years ago
zhexu1997 / HiSA
View on GitHub
☆10Aug 21, 2022Updated 3 years ago
rene-puschinger / ppm
View on GitHub
Prediction by Partial Matching
☆16Apr 3, 2020Updated 6 years ago
ronghanghu / snmn
View on GitHub
Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018
☆71Nov 17, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ba305 / LightGCN-Spotify
View on GitHub
☆10Dec 30, 2021Updated 4 years ago
mshukor / EvALign-ICL
View on GitHub
[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …
☆22Mar 1, 2024Updated 2 years ago
CarolineGao / LoRA-Dataset
View on GitHub
[NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering
☆12Jan 5, 2024Updated 2 years ago
chrisdxie / in-depth-VI-tutorial
View on GitHub
☆11Jun 17, 2016Updated 10 years ago
aurooj / MMFT-BERT
View on GitHub
☆14Jun 29, 2024Updated 2 years ago
loscheris / VideoCaptioning_att
View on GitHub
A video captioning tool using S2VT method and attention mechanism (TensorFlow)
☆15Oct 14, 2018Updated 7 years ago
trinhdrew1418 / intermodal-triplet-network
View on GitHub
Triplet neural network for joint representation learning for text and images
☆10Mar 17, 2019Updated 7 years ago