Hierarchical Question-Image Co-Attention for Visual Question Answering
☆24Jun 2, 2019Updated 7 years ago
Alternatives and similar repositories for VQA
Users that are interested in VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model☆16Oct 3, 2023Updated 2 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"☆14Mar 25, 2023Updated 3 years ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated 2 years ago
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆107Oct 14, 2019Updated 6 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago
- A simple Flask app to generate answer given an image and a natural language question about the image. The app uses a deep learning model,…☆12Nov 21, 2022Updated 3 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17May 27, 2019Updated 7 years ago
- Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…☆21Apr 7, 2021Updated 5 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- Read-only mirror of https://git.hloth.dev/hloth/vfs-status-bot☆12Jul 14, 2025Updated 11 months ago
- ☆12Sep 25, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Apr 1, 2017Updated 9 years ago
- ☆12Sep 30, 2024Updated last year
- Implementation of Hashtag Recommendation for Photo Sharing Services☆12Nov 23, 2018Updated 7 years ago
- ☆10Aug 21, 2022Updated 3 years ago
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- Source code for Findings of EMNLP 2021 paper ``Keyphrase Generation with Fine-Grained Evaluation-Guided Reinforcement Learning``☆13Nov 9, 2021Updated 4 years ago
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 6 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 4 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆458Dec 16, 2020Updated 5 years ago
- Image Fluency Scores in R☆12Jun 3, 2026Updated 2 weeks ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- ☆27Aug 16, 2022Updated 3 years ago
- Code and data for Aesthetic Image Captioning from Weakly-Labelled Photographs☆34Oct 24, 2019Updated 6 years ago
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- Replication package for evaluation of code generation metrics☆17Nov 24, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 3 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- A Keras implementation of VQA using the easy-VQA dataset.☆22Aug 16, 2020Updated 5 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 3 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- AAAI-22 paper: Synthetic Disinformation Attacks on Automated Fact Verification Systems☆12Feb 23, 2022Updated 4 years ago