lixiangpengcs/PSAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lixiangpengcs/PSAC)

lixiangpengcs / PSAC

Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering

☆27

Alternatives and similar repositories for PSAC

Users that are interested in PSAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fanchenyou / HME-VideoQA
View on GitHub
Heterogeneous Memory Enhanced Multimodal Attention Model for VideoQA
☆55Sep 13, 2021Updated 4 years ago
salesforce / BiST
View on GitHub
Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)
☆11Jun 16, 2025Updated last year
thaolmk54 / hcrn-videoqa
View on GitHub
Implementation for the paper "Hierarchical Conditional Relation Networks for Video Question Answering" (Le et al., CVPR 2020, Oral)
☆135Jul 25, 2024Updated last year
madeleinegrunde / AGQA_baselines_code
View on GitHub
☆18Nov 1, 2023Updated 2 years ago
doc-doc / NExT-OE
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆30Jul 18, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jayleicn / TVQAplus
View on GitHub
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
☆132Oct 25, 2022Updated 3 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
doc-doc / HQGA
View on GitHub
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
☆35Sep 17, 2022Updated 3 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
LuoweiZhou / pytorch-pretrained-BERT
View on GitHub
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…
☆11May 30, 2019Updated 7 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
jd730 / STRG
View on GitHub
Pytorch Implementation of Videos as Space-Time Region Graphs
☆27Jul 17, 2026Updated last week
noagarcia / ROLL-VideoQA
View on GitHub
PyTorch code for ROLL, a knowledge-based video story question answering model.
☆21Sep 29, 2020Updated 5 years ago
Kyung-Min / Deep-Embedded-Memory-Networks
View on GitHub
https://arxiv.org/abs/1707.00836
☆21Nov 6, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
affect2mm / emotion-timeseries
View on GitHub
☆16Nov 24, 2020Updated 5 years ago
jayleicn / VideoLanguageFuturePred
View on GitHub
[EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction
☆52Aug 20, 2022Updated 3 years ago
ruixuejianfei / SCAN
View on GitHub
Code for Self-and-Collaborative Attention Network from "SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification…
☆26Jun 1, 2019Updated 7 years ago
Sy-Zhang / TCMN-Release
View on GitHub
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆16Oct 22, 2022Updated 3 years ago
jialinwu17 / self_critical_vqa
View on GitHub
Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
☆40Sep 9, 2019Updated 6 years ago
yashkant / concat-vqa
View on GitHub
Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021
☆19Jul 27, 2021Updated 4 years ago
jhyuklee / dmn-pytorch
View on GitHub
Re-implementation: Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
☆14Apr 7, 2019Updated 7 years ago
Annusha / LIReC
View on GitHub
Learning Interactions and Relationships between Movie Characters (CVPR'20)
☆22Apr 12, 2023Updated 3 years ago
Deanplayerljx / tab-vcr
View on GitHub
Pytorch implementation for our NeurIPS 2019 paper "TAB-VCR: Tags and Attributes based VCR Baselines" https://arxiv.org/abs/1910.14671
☆19May 6, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
salmon1802 / SimCEN
View on GitHub
[MM 2024] SimCEN: Simple Contrast-enhanced Network for CTR Prediction
☆13Apr 10, 2025Updated last year
agethen / RPAN
View on GitHub
Our implementation of Recurrent Pose Attention in Du et al.: "RPAN: An End-to-End Recurrent Pose-attention Network for Action Recognition…
☆37Nov 24, 2018Updated 7 years ago
YunseokJANG / tgif-qa
View on GitHub
Repository for our CVPR 2017 and IJCV: TGIF-QA
☆180Sep 6, 2021Updated 4 years ago
yj-yu / lsmdc
View on GitHub
☆33Nov 12, 2018Updated 7 years ago
WellyZhang / ACRE
View on GitHub
ACRE: Abstract Causal REasoning Beyond Covariation
☆19Dec 7, 2021Updated 4 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
InterDigitalInc / DialogSummary-VideoQA
View on GitHub
☆10Mar 30, 2022Updated 4 years ago
KaihuaTang / VQA2.0-Recent-Approachs-2018.pytorch
View on GitHub
A pytroch reimplementation of "Bilinear Attention Network", "Intra- and Inter-modality Attention", "Learning Conditioned Graph Structures…
☆300Jan 6, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
idansc / fga
View on GitHub
☆30Oct 20, 2021Updated 4 years ago
cvlab-tohoku / Dense-CoAttention-Network
View on GitHub
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
☆107Oct 14, 2019Updated 6 years ago
ych133 / How2R-and-How2QA
View on GitHub
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Oct 15, 2020Updated 5 years ago
Cadene / murel.bootstrap.pytorch
View on GitHub
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
☆194Feb 9, 2020Updated 6 years ago
rasoolfa / videocap
View on GitHub
Memory-augmented Attention Modelling for Videos
☆10Apr 24, 2017Updated 9 years ago
zongshenmu / attention_knowledge_vqa
View on GitHub
vqa drived by bottom-up and top-down attention and knowledge
☆14Nov 21, 2018Updated 7 years ago
zjiayao / cvpr17
View on GitHub
CVPR '17 Paper Collection
☆10Jul 17, 2017Updated 9 years ago