rentainhe/TRAR-VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rentainhe/TRAR-VQA)

rentainhe / TRAR-VQA

[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"

☆68

Alternatives and similar repositories for TRAR-VQA

Users that are interested in TRAR-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
szzexpoi / rex
View on GitHub
Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"
☆22Nov 21, 2023Updated 2 years ago
TIAN-viola / DynRT
View on GitHub
Official implementation of Dynamic Routing Transformer Network for Multimodal Sarcasm Detection (ACL'23)
☆35Jul 9, 2023Updated 3 years ago
val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
zhangxuying1004 / RSTNet
View on GitHub
Official Code for 'RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words' (CVPR 2021)
☆123Dec 17, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
luo3300612 / Transformer-Captioning
View on GitHub
Optimized code based on M2 for faster image captioning training
☆21Nov 18, 2022Updated 3 years ago
zhangxi1997 / VQACL
View on GitHub
VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)
☆45Mar 28, 2024Updated 2 years ago
yashkant / sam-textvqa
View on GitHub
Official code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
☆65Sep 15, 2021Updated 4 years ago
szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
linjieli222 / VQA_ReGAT
View on GitHub
Research Code for ICCV 2019 paper "Relation-aware Graph Attention Network for Visual Question Answering"
☆187Apr 15, 2021Updated 5 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
aioz-ai / CFR_VQA
View on GitHub
Coarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
☆48Apr 22, 2026Updated 3 months ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
zaynmi / seada-vqa
View on GitHub
A pytorch implemetation of data augmentation method for visual question answering
☆21May 25, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Oneflow-Inc / vision
View on GitHub
Datasets, Transforms and Models specific to Computer Vision
☆91Nov 17, 2023Updated 2 years ago
YouHuang67 / mamba-code-explained
View on GitHub
☆19Jan 7, 2026Updated 6 months ago
ronghanghu / lcgn
View on GitHub
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
☆92Aug 9, 2019Updated 6 years ago
LeeYN-43 / Clover
View on GitHub
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆39Feb 15, 2023Updated 3 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
MayankSingal / VQA-Transformer
View on GitHub
Visual Question Answering through transformers.
☆13Sep 21, 2018Updated 7 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
LandyGuo / Download_HowTo100M
View on GitHub
code for downloading videos from HowTo100M dataset
☆18May 13, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yangxuntu / catt
View on GitHub
☆12Mar 8, 2021Updated 5 years ago
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
yangxuntu / lxmertcatt
View on GitHub
☆79Oct 8, 2022Updated 3 years ago
downdric / MSD
View on GitHub
The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"
☆36Dec 6, 2024Updated last year
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
mrwu-mac / DIFNet
View on GitHub
[CVPR 2022] This repository is for the paper ``DIFNet: Boosting Visual Information Flow for Image Captioning'' .
☆21Nov 28, 2022Updated 3 years ago
China-UK-ZSL / ZS-F-VQA
View on GitHub
[Paper][ISWC 2021] Zero-shot Visual Question Answering using Knowledge Graph
☆72Feb 9, 2024Updated 2 years ago
carefree0910 / carefree-flow
View on GitHub
Deep Learning ❤️ OneFlow
☆19Aug 26, 2021Updated 4 years ago
MILVLG / openvqa
View on GitHub
A lightweight, scalable, and general framework for visual question answering research
☆334Sep 3, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
jokieleung / awesome-visual-question-answering
View on GitHub
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Common…
☆672Jul 6, 2023Updated 3 years ago
AndersonStra / Mucko
View on GitHub
implementation for Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
☆10Mar 17, 2022Updated 4 years ago
rentainhe / pytorch-distributed-training
View on GitHub
Simple tutorials on Pytorch DDP training
☆278Aug 19, 2022Updated 3 years ago
jingheya / PixelFolder
View on GitHub
PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation (ECCV 2022)
☆33Jul 21, 2022Updated 4 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
CrossmodalGroup / SSL-VQA
View on GitHub
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
☆52Aug 21, 2020Updated 5 years ago