Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"
☆53Jun 29, 2021Updated 5 years ago
Alternatives and similar repositories for AiR
Users that are interested in AiR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆27Mar 24, 2021Updated 5 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆44Mar 15, 2024Updated 2 years ago
- ☆22Oct 9, 2021Updated 4 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33May 12, 2022Updated 4 years ago
- Code released for our CHI2023 paper "UEyes: Understanding Visual Saliency across User Interface Types"☆36Jul 16, 2024Updated last year
- Implementation of Graph Based Visual Saliency algorithm by J. Harel, C. Koch, and P. Perona☆61Dec 23, 2019Updated 6 years ago
- The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Aug 19, 2021Updated 4 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15May 6, 2021Updated 5 years ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated 2 years ago
- Toolbox for processing, visualising, comparing and generating data related to gaze in 360 contexts (VR notably)☆28Jun 12, 2025Updated last year
- Localized Narratives☆86Sep 9, 2021Updated 4 years ago
- ☆24May 4, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- Tracking with Human-Intent Reasoning☆77Nov 4, 2024Updated last year
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆66May 9, 2025Updated last year
- ☆23Aug 21, 2021Updated 4 years ago
- ☆14Jan 30, 2017Updated 9 years ago
- Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering☆31Apr 30, 2024Updated 2 years ago
- Learning interaction hotspots from egocentric video☆52Dec 12, 2022Updated 3 years ago
- A PyTorch implementation of the Probabilistic U-Net, applied to probabilistic glioma growth☆43Jul 10, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official implementation for the MM'22 paper.☆14Jun 30, 2022Updated 4 years ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆21Jun 2, 2025Updated last year
- [Neurocomputing] WAIT: Feature Warping for Animation to Illustration video Translation using GANs☆10Apr 4, 2025Updated last year
- ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration☆57Jun 13, 2023Updated 3 years ago
- ☆19May 25, 2024Updated 2 years ago
- dataset cleansing for Visual Genome☆30Apr 26, 2017Updated 9 years ago
- ☆10May 23, 2023Updated 3 years ago
- AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting (TMM 2021)☆10Feb 24, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Jun 1, 2018Updated 8 years ago
- Official code for the paper "Contrast and Classify: Training Robust VQA Models" published at ICCV, 2021☆19Jul 27, 2021Updated 4 years ago
- An R package for analyzing scanpath patterns in eye movements☆42Jun 2, 2026Updated 3 weeks ago
- Sample application using frame3Sharp CAD-tool framework☆13Oct 11, 2018Updated 7 years ago
- [ECCV2024] Nonverbal Interaction Detection☆31Oct 30, 2024Updated last year
- Data preprocessing for IUPUI-CSRC Pedestrian Situated Intent (PSI) benchmark dataset.☆11Oct 5, 2023Updated 2 years ago
- Attention-Driven Loss for Anomaly Detection in Video Surveillance☆15May 25, 2021Updated 5 years ago