karunraju / VQAView external linksLinks
Hierarchical Question-Image Co-Attention for Visual Question Answering
☆24Jun 2, 2019Updated 6 years ago
Alternatives and similar repositories for VQA
Users that are interested in VQA are comparing it to the libraries listed below
Sorting:
- ☆12Aug 29, 2019Updated 6 years ago
- PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"☆14Mar 25, 2023Updated 2 years ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated last year
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 2 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model☆16Oct 3, 2023Updated 2 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 5 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17May 27, 2019Updated 6 years ago
- Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…☆21Apr 7, 2021Updated 4 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆107Oct 14, 2019Updated 6 years ago
- Code for the Grounded Visual Question Answering (GVQA) model from the paper -- Don't Just Assume; Look and Answer: Overcoming Priors for …☆27Mar 10, 2022Updated 3 years ago
- Annotate the point cloud data from KITTI with the labels☆28Sep 14, 2025Updated 5 months ago
- ☆27Aug 16, 2022Updated 3 years ago
- Read-only mirror of https://git.hloth.dev/hloth/vfs-status-bot☆10Jul 14, 2025Updated 7 months ago
- Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering☆27Apr 15, 2021Updated 4 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 2 years ago
- ☆11Mar 14, 2023Updated 2 years ago
- ☆13Jan 22, 2026Updated 3 weeks ago
- ☆16Jul 24, 2022Updated 3 years ago
- Simple Fast Virtual Machine☆10Apr 3, 2024Updated last year
- The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"☆30Jan 8, 2019Updated 7 years ago
- Code implementation of DeepRare☆32Dec 9, 2025Updated 2 months ago
- Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization☆34Sep 3, 2020Updated 5 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆458Dec 16, 2020Updated 5 years ago
- Verilog SDR SDRAM controller for FPGA Xilinx and Lattice☆17Jan 3, 2021Updated 5 years ago
- This is an official implementation of video classification for our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Atten…☆12Jan 30, 2021Updated 5 years ago
- MuJoCo人形机器人仿真控制系统☆41Dec 5, 2025Updated 2 months ago
- ☆13Nov 28, 2021Updated 4 years ago
- Code for "Learning Harmonic Molecular Representations on Riemannian Manifold", ICLR, 2023☆10Mar 23, 2023Updated 2 years ago
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆16Mar 3, 2023Updated 2 years ago
- Example Projects for the Microsemi SmartFusion 2☆11Dec 10, 2017Updated 8 years ago
- Code and data for the Nature Machine Intelligence paper "Knowledge graph-enhanced molecular contrastive learning with functional prompt".☆10May 16, 2023Updated 2 years ago