aimagelab / awesome-human-visual-attentionLinks
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆54Updated last month
Alternatives and similar repositories for awesome-human-visual-attention
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
Sorting:
- ☆11Updated 4 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆36Updated last year
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 4 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆20Updated 2 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆20Updated this week
- pytorch implementation of the different DeepGaze models☆148Updated 2 years ago
- Scanpath metrics in Python☆30Updated 4 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 4 months ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆18Updated 9 months ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆12Updated 4 months ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆41Updated 6 months ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Updated last year
- [NeurIPS2024 D&B Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆28Updated 2 months ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 3 months ago
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Updated 2 years ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆33Updated 6 months ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Improving neural network representations using human similarity judgments☆13Updated 7 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆35Updated 2 months ago
- [CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆62Updated 3 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆99Updated 11 months ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated 4 months ago
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆15Updated last year
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆70Updated 2 months ago
- ☆36Updated 4 months ago
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆68Updated 2 years ago
- ☆80Updated 2 years ago