aimagelab / awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆50Updated 4 months ago
Alternatives and similar repositories for awesome-human-visual-attention:
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆33Updated last year
- ☆10Updated 2 months ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆19Updated 7 months ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 4 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆24Updated 2 months ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆40Updated 4 months ago
- pytorch implementation of the different DeepGaze models☆137Updated last year
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated 2 months ago
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆13Updated 11 months ago
- Scanpath metrics in Python☆31Updated 3 years ago
- SMG source code and dataset☆16Updated last year
- Improving neural network representations using human similarity judgments☆14Updated 5 months ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated last month
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Updated 2 years ago
- [CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆61Updated last month
- ☆59Updated 9 months ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval." CVPR 2022☆105Updated 2 years ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆17Updated 7 months ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆48Updated last year
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆27Updated 3 weeks ago
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆18Updated 4 months ago
- [CVPR2023] Context De-confounded Emotion Recognition☆18Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated 10 months ago
- ☆78Updated last year
- [CVPR 2023] Official code repository for "How you feelin'? Learning Emotions and Mental States in Movie Scenes". https://arxiv.org/abs/23…☆56Updated 6 months ago
- [ECCV 2024] UMBRAE: Unified Multimodal Brain Decoding | Unveiling the 'Dark Side' of Brain Modality☆46Updated 7 months ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆12Updated 2 months ago
- ☆16Updated 10 months ago