Pytorch implementation of VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf) using VQA v2.0 dataset for open-ended task
☆22Jul 30, 2020Updated 5 years ago
Alternatives and similar repositories for VQA_CNN-LSTM
Users that are interested in VQA_CNN-LSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 18, 2024Updated 2 years ago
- A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset …☆15Dec 12, 2023Updated 2 years ago
- Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)☆37Jan 20, 2022Updated 4 years ago
- Medical Visual Question Answering via Conditional Reasoning [ACM MM 2020]☆64Aug 20, 2021Updated 4 years ago
- Build and visualize Word2Vec model on Amazon health and personal care reviews corpus☆24Sep 10, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Oct 22, 2019Updated 6 years ago
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- ☆391Mar 11, 2021Updated 5 years ago
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering☆13Jan 5, 2024Updated 2 years ago
- Strong baseline for visual question answering☆241Mar 13, 2023Updated 3 years ago
- 🧐 DeepFake Detection with PyTorch☆18Aug 7, 2023Updated 2 years ago
- ☆351Oct 2, 2018Updated 7 years ago
- Modular and Simple approach to VQA in Keras☆21Sep 6, 2017Updated 8 years ago
- [IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering☆16Oct 9, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆23Jul 7, 2024Updated last year
- Visual Question Answering Demo and Algorithmia API☆25Feb 17, 2019Updated 7 years ago
- Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”☆49Nov 10, 2022Updated 3 years ago
- CENG501 Project Repositories☆17Aug 22, 2021Updated 4 years ago
- Tutorial on parallel processing of raster data in the {stars} package☆11Sep 22, 2023Updated 2 years ago
- Studi Kasus PHP MySQL : Aplikasi Todolist☆17Feb 22, 2021Updated 5 years ago
- ☆10Jun 13, 2023Updated 2 years ago
- A list of recent papers regarding visual(image) question answering「mainly from arxiv.com」☆16Mar 6, 2019Updated 7 years ago
- Abdominal Organ Segmentation using Multi Decoder Network (MDNet) [Accepted at ICASSP 2025]☆13Apr 15, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the Viz…☆14Jun 27, 2023Updated 2 years ago
- ROS2 wrapper for depth anything☆16Mar 11, 2024Updated 2 years ago
- An implementation of the temporal cluster matching method for detecting change in structure footprints from time series of remotely sense…☆21Nov 3, 2021Updated 4 years ago
- ☆11Jun 21, 2025Updated 9 months ago
- Tools to use with Brian 2, in particular for visualization☆21Mar 31, 2026Updated 2 weeks ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Sep 4, 2021Updated 4 years ago
- parallel image processing algorithms using pymp☆10May 11, 2017Updated 8 years ago
- ☆17Dec 24, 2023Updated 2 years ago
- pix2pix model for generating terrain☆17Jan 7, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- Visual Question Answering task written in Keras that answers questions about images☆156May 10, 2019Updated 6 years ago
- This a clean and easy-to-use implementation of YOLOv8 in PyTorch, made with ❤️ by Theos AI.☆10Nov 13, 2023Updated 2 years ago
- Urbanization detection using computer vision algorithms to reduce the reliance on the data of government surveys, which will speed up the…☆17Dec 18, 2019Updated 6 years ago
- ☆19Dec 19, 2025Updated 3 months ago
- ☆12Sep 19, 2021Updated 4 years ago
- PyTorch implementation of DoubleUNet for medical image segmentation☆14Apr 6, 2026Updated last week