szzexpoi / AiRView external linksLinks
Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"
☆50Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for AiR
Users that are interested in AiR are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆23Mar 24, 2021Updated 4 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆43Mar 15, 2024Updated last year
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- Code released for our CHI2023 paper "UEyes: Understanding Visual Saliency across User Interface Types"☆33Jul 16, 2024Updated last year
- Implementation of Graph Based Visual Saliency algorithm by J. Harel, C. Koch, and P. Perona☆58Dec 23, 2019Updated 6 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Aug 19, 2021Updated 4 years ago
- ☆18Dec 8, 2022Updated 3 years ago
- Attention-Driven Loss for Anomaly Detection in Video Surveillance☆16May 25, 2021Updated 4 years ago
- Code for ECML/PKDD paper: "LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport"☆17Jun 27, 2021Updated 4 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering☆65Sep 4, 2021Updated 4 years ago
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 3 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- ☆21May 4, 2022Updated 3 years ago
- A PyTorch implementation of the Probabilistic U-Net, applied to probabilistic glioma growth☆43Jul 10, 2019Updated 6 years ago
- Toolbox for processing, visualising, comparing and generating data related to gaze in 360 contexts (VR notably)☆27Jun 12, 2025Updated 8 months ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆56Jun 13, 2023Updated 2 years ago
- ☆22Oct 21, 2024Updated last year
- ☆23Aug 21, 2021Updated 4 years ago
- dataset cleansing for Visual Genome☆30Apr 26, 2017Updated 8 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- Assessing Reliability and Challenges of Uncertainty Estimations for Medical Image Segmentation☆56Oct 3, 2023Updated 2 years ago
- Rank-aware Attention Network from 'The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos'☆30Apr 16, 2021Updated 4 years ago
- Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning☆38Mar 12, 2025Updated 11 months ago
- My daily arxiv reading note☆30Nov 10, 2021Updated 4 years ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- ☆30Dec 16, 2022Updated 3 years ago
- Pytorch implementation of Capsule Network with Dynamic Routing☆28May 19, 2020Updated 5 years ago
- Jupyter Notebooks demonstrating how to process Pupil Player exports☆35May 2, 2022Updated 3 years ago
- Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)☆123Aug 17, 2022Updated 3 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated last year
- An R package for analyzing linguistic alignment between partners in conversation transcripts☆13Jan 30, 2026Updated 2 weeks ago
- Matlab code for our CVPR 2014 work "Scene-Independent Group Profiling in Crowd".☆34Mar 13, 2016Updated 9 years ago