Memory, Attention and Composition (MAC) Network for CLEVR/GQA implemented in PyTorch
☆27Aug 26, 2024Updated last year
Alternatives and similar repositories for mac-network-pytorch-gqa
Users that are interested in mac-network-pytorch-gqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43May 13, 2021Updated 4 years ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- Neural State Machine implemented in PyTorch☆71Oct 10, 2019Updated 6 years ago
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆44Mar 7, 2021Updated 5 years ago
- Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "☆93Mar 17, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019☆92Aug 9, 2019Updated 6 years ago
- Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).☆11Feb 10, 2023Updated 3 years ago
- Local self-attention in Transformer for visual question answering☆13Mar 17, 2024Updated 2 years ago
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- Action Proposals generated by deep models☆29Mar 19, 2017Updated 9 years ago
- Paper List about Radiology Report Generation and also some medical image captioning☆11Oct 5, 2021Updated 4 years ago
- Evaluation codes of "From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models".☆16May 15, 2023Updated 2 years ago
- [CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias☆131Dec 15, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch implementation of video captioning☆13Sep 24, 2017Updated 8 years ago
- ☆36Apr 14, 2021Updated 4 years ago
- A Lip Reading Neural Network using LSTM, implemented upon keras☆17Mar 16, 2016Updated 10 years ago
- ☆15Jul 1, 2024Updated last year
- Deep Modular Co-Attention Networks for Visual Question Answering☆10Jul 10, 2019Updated 6 years ago
- Implementing CNN in PyTorch with Custom Dataset and Transfer Learning☆11Aug 24, 2020Updated 5 years ago
- ☆10Aug 21, 2022Updated 3 years ago
- ☆15Apr 8, 2022Updated 3 years ago
- 한국어 생성 모델의 상식 추론을 위한 KommonGen 데이터셋입니다.☆17Oct 5, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 19, 2025Updated 11 months ago
- [ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"☆69Oct 11, 2021Updated 4 years ago
- ☆38Jan 20, 2023Updated 3 years ago
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018☆71Nov 17, 2019Updated 6 years ago
- ☆14Jun 29, 2024Updated last year
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering☆13Jan 5, 2024Updated 2 years ago
- Korean Commonsense Knowledge Graph☆15Dec 23, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A video captioning tool using S2VT method and attention mechanism (TensorFlow)☆15Oct 14, 2018Updated 7 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- ☆19Apr 30, 2024Updated last year
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆30Apr 21, 2023Updated 2 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆23Nov 1, 2025Updated 4 months ago
- Transformation Driven Visual Reasoning - CVPR 2021☆36May 27, 2023Updated 2 years ago