karunraju/VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/karunraju/VQA)

karunraju / VQA

Hierarchical Question-Image Co-Attention for Visual Question Answering

☆24

Alternatives and similar repositories for VQA

Users that are interested in VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Axe-- / Visual-Question-Answering
View on GitHub
PyTorch Implementation of VQA Baseline & Hierarchical Co-Attention model
☆16Oct 3, 2023Updated 2 years ago
SkyOL5 / VQA-CoAttention
View on GitHub
☆12Aug 29, 2019Updated 6 years ago
pyyush / GraphML
View on GitHub
PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"
☆14Mar 25, 2023Updated 3 years ago
crodriguezo / TMLGA
View on GitHub
Repository of proposal-free temporal moment localization work
☆33Jun 11, 2024Updated 2 years ago
zmzhang2000 / MIGCN
View on GitHub
Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
☆16May 23, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
forwchen / HVTG
View on GitHub
Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"
☆17Aug 25, 2020Updated 5 years ago
JaywongWang / TGN
View on GitHub
Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"
☆17Nov 21, 2022Updated 3 years ago
arya46 / VQA-Flask-App
View on GitHub
A simple Flask app to generate answer given an image and a natural language question about the image. The app uses a deep learning model,…
☆12Nov 21, 2022Updated 3 years ago
dazhang-cv / MAN
View on GitHub
This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"
☆17May 27, 2019Updated 7 years ago
crodriguezo / DORi
View on GitHub
Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompan…
☆21Apr 7, 2021Updated 5 years ago
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
szc19990412 / LNPL-MIL
View on GitHub
☆12Sep 25, 2023Updated 2 years ago
rsinghlab / OvO
View on GitHub
☆12Sep 30, 2024Updated last year
leafage-autumn / Vulnerability_classify
View on GitHub
NVD，CNNVD软件漏洞数据集，漏洞文本预处理，训练算法模型进行漏洞分类
☆11Oct 13, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhexu1997 / HiSA
View on GitHub
☆10Aug 21, 2022Updated 3 years ago
yihong-97 / STICT
View on GitHub
Code and Dataset for our CVPR 2022 paper "Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training"
☆12Jul 8, 2022Updated 4 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
mehak25 / BiGAN
View on GitHub
☆13Aug 2, 2021Updated 4 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
madhawav / MML
View on GitHub
Multi-faceted Video Moment Localizer
☆17Jun 19, 2020Updated 6 years ago
Huntersxsx / RaNet
View on GitHub
source code of our RaNet in EMNLP 2021
☆30May 31, 2022Updated 4 years ago
MILVLG / mcan-vqa
View on GitHub
Deep Modular Co-Attention Networks for Visual Question Answering
☆459Dec 16, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
teowu / DOVER-Dev
View on GitHub
This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.
☆14Oct 29, 2023Updated 2 years ago
tanghaoyu258 / ACRM-for-moment-retrieval
View on GitHub
☆27Aug 16, 2022Updated 3 years ago
Alex-HaochenLi / Soft-InfoNCE
View on GitHub
[EMNLP'23] Code for 'Rethinking Negative Pairs in Code Search'
☆16Oct 17, 2023Updated 2 years ago
fawazsammani / look-and-modify
View on GitHub
Look and Modify: Modification Networks for Image Captioning, BMVC 2019
☆21Feb 18, 2020Updated 6 years ago
shuoyang129 / eamat
View on GitHub
Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)
☆12Oct 11, 2022Updated 3 years ago
BestiVictory / DPC-Captions
View on GitHub
A image caption dataset about images from www.dpchallenge.com.
☆20Dec 12, 2019Updated 6 years ago
AliceOTHMANI / EmoAudioNet
View on GitHub
Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)
☆14Jul 13, 2020Updated 6 years ago
Yibing-Du / adversarial-factcheck
View on GitHub
AAAI-22 paper: Synthetic Disinformation Attacks on Automated Fact Verification Systems
☆12Feb 23, 2022Updated 4 years ago
DongqiFu / DISCO
View on GitHub
DISCO: Comprehensive and Explainable Disinformation Detection, CIKM 2022
☆10May 5, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
BAI-Yeqi / Statistical-Properties-of-Dot-Product
View on GitHub
☆17Nov 23, 2021Updated 4 years ago
rdemedrano / crann_traffic
View on GitHub
A Spatio-Temporal Spot-Forecasting Framework for Urban Traffic Prediction
☆15Oct 27, 2020Updated 5 years ago
praveena2j / RecurrentJointAttentionwithLSTMs
View on GitHub
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Nov 29, 2024Updated last year
UKPLab / emnlp2022-missing-counter-evidence
View on GitHub
Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…
☆10Jun 21, 2023Updated 3 years ago
shuaizengMU / PEFT-SP
View on GitHub
☆21Oct 22, 2024Updated last year
RKorzeniowski / BigBiGAN-PyTorch
View on GitHub
Unofficail pytorch implementation of BigBiGAN
☆11Mar 26, 2021Updated 5 years ago