PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model
☆16Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for Visual-Question-Answering
Users that are interested in Visual-Question-Answering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research Code for NeurIPS 2020 Spotlight paper "Large-Scale Adversarial Training for Vision-and-Language Representation Learning": LXMERT…☆21Oct 20, 2020Updated 5 years ago
- Implementation of Mutan+ArticleNet on OKVQA☆10Jan 11, 2021Updated 5 years ago
- The Easy Visual Question Answering dataset.☆34Oct 3, 2023Updated 2 years ago
- Re-implementation for 'R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering'.☆12Mar 13, 2026Updated last month
- Hierarchical Question-Image Co-Attention for Visual Question Answering☆24Jun 2, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple Flask app to generate answer given an image and a natural language question about the image. The app uses a deep learning model,…☆12Nov 21, 2022Updated 3 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- CNN+LSTM, Attention based, and MUTAN-based models for Visual Question Answering☆78Jan 19, 2020Updated 6 years ago
- Dockerfile for deep learning on GPUs☆10Aug 10, 2018Updated 7 years ago
- An environment for mobile angets to interact with realistic android device or android emulator☆13Jul 19, 2024Updated last year
- ☆10May 16, 2021Updated 4 years ago
- Supercharge your Gaianet node by generating a vector knowledge base from any API. Demo slides: https://hackmd.io/@santteegt/ByoykY4nC#/ L…☆11Nov 29, 2024Updated last year
- Triplet neural network for joint representation learning for text and images☆10Mar 17, 2019Updated 7 years ago
- A reading list of papers about Visual Question Answering.☆35Aug 17, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PatchBackdoor is a code base associated with paper PatchBackdoor.☆12Aug 27, 2024Updated last year
- ☆12Dec 8, 2022Updated 3 years ago
- [ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention☆35Dec 15, 2022Updated 3 years ago
- An awesome list of machine learning relative system design blog posts from cool eng blogs☆14Jun 2, 2020Updated 5 years ago
- MMBERT: Multimodal BERT Pretraining for Improved Medical VQA☆39Mar 22, 2021Updated 5 years ago
- Multiple Meta-model Quantifying for Medical Visual Question Answering (MICCAI 2021)☆37Oct 12, 2022Updated 3 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- ☆14Mar 27, 2023Updated 3 years ago
- BERT系列模型、搜搜、剪枝、蒸馏☆13Sep 10, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Visual Question Answering model implemented in MindSpore and PyTorch. The model is a reimplementation of the paper *Show, Ask, Attend, …☆10Jul 27, 2021Updated 4 years ago
- This notebook presents a pipeline to process raw data files of battery cycling and the prediction of their useful life before the degrada…☆13Apr 20, 2021Updated 4 years ago
- ☆15May 10, 2021Updated 4 years ago
- [EMNLP 2022] The baseline code for META-GUI dataset☆14Jul 9, 2024Updated last year
- ☆44Jun 16, 2025Updated 9 months ago
- Visual Question Generation☆11Aug 20, 2024Updated last year
- ☆351Oct 2, 2018Updated 7 years ago
- Adversarial perturbations on word embeddings of BERT☆13Jan 17, 2021Updated 5 years ago
- BLOCK (AAAI 2019), with a multimodal fusion library for deep learning models☆355Dec 4, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 🧰 数据科学科研工具箱☆13Mar 22, 2025Updated last year
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"☆14Mar 25, 2023Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆16Sep 5, 2023Updated 2 years ago
- CloudCV Visual Question Answering Demo☆67Nov 4, 2022Updated 3 years ago
- GCNs Analysis: Visualization, Error Cases etc.☆14Feb 15, 2023Updated 3 years ago