Hierarchical Question-Image Co-Attention for Visual Question Answering
☆24Jun 2, 2019Updated 6 years ago
Alternatives and similar repositories for VQA
Users that are interested in VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model☆16Oct 3, 2023Updated 2 years ago
- ☆12Aug 29, 2019Updated 6 years ago
- Repository of proposal-free temporal moment localization work☆33Jun 11, 2024Updated last year
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 3 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Aug 25, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering☆107Oct 14, 2019Updated 6 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"☆17May 27, 2019Updated 7 years ago
- Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…☆21Apr 7, 2021Updated 5 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- ☆12Aug 7, 2019Updated 6 years ago
- ☆10Aug 21, 2022Updated 3 years ago
- vqa drived by bottom-up and top-down attention and knowledge☆14Nov 21, 2018Updated 7 years ago
- Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)☆23Jun 26, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Aug 2, 2021Updated 4 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- Multi-faceted Video Moment Localizer☆17Jun 19, 2020Updated 5 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Deep Modular Co-Attention Networks for Visual Question Answering☆459Dec 16, 2020Updated 5 years ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- ☆27Aug 16, 2022Updated 3 years ago
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Look and Modify: Modification Networks for Image Captioning, BMVC 2019☆21Feb 18, 2020Updated 6 years ago
- A pytorch implemetation of data augmentation method for visual question answering☆21May 25, 2023Updated 3 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Dec 12, 2019Updated 6 years ago
- Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"☆29May 31, 2023Updated 2 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)☆14Jul 13, 2020Updated 5 years ago
- ☆27Oct 7, 2021Updated 4 years ago
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tutorial for text classification with BERT, using HuggingFace's transformers.☆13Jan 15, 2020Updated 6 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- ☆14Aug 6, 2021Updated 4 years ago
- ☆36Jan 23, 2024Updated 2 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- NLQF is a tool to filter query-appropriate comments for building high-quality code search datasets.☆19Feb 15, 2022Updated 4 years ago
- implement gat with batch☆10Nov 28, 2020Updated 5 years ago