aimagelab / awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆49Updated 4 months ago
Alternatives and similar repositories for awesome-human-visual-attention:
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
- ☆10Updated last month
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆33Updated last year
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆18Updated 6 months ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 4 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆23Updated last month
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated 3 weeks ago
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. (CVPR 2023)☆60Updated 3 weeks ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- pytorch implementation of the different DeepGaze models☆132Updated last year
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆39Updated 3 months ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆19Updated last year
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆13Updated 10 months ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated last month
- Scanpath metrics in Python☆31Updated 3 years ago
- Improving neural network representations using human similarity judgments☆14Updated 4 months ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Updated last year
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆130Updated 2 years ago
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆17Updated 3 months ago
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆63Updated last month
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆96Updated 9 months ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆23Updated 11 months ago
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆56Updated 8 months ago
- [CVPR'25] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆21Updated last week
- ☆62Updated last year
- Target-absent Human Attention (ECCV2022)☆17Updated 2 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆102Updated 2 years ago
- ☆56Updated 8 months ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆35Updated last month
- ☆77Updated last year
- ☆50Updated 10 months ago