aimagelab / awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆51Updated last week
Alternatives and similar repositories for awesome-human-visual-attention
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
Sorting:
- ☆10Updated 2 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆36Updated last year
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 4 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆24Updated 2 months ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆19Updated 8 months ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆19Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 2 months ago
- Scanpath metrics in Python☆30Updated 3 years ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆18Updated 8 months ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆40Updated 5 months ago
- pytorch implementation of the different DeepGaze models☆138Updated last year
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆18Updated 3 months ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆28Updated last month
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆13Updated last year
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Updated 2 years ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Updated last year
- Improving neural network representations using human similarity judgments☆13Updated 5 months ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆12Updated 2 months ago
- [CVPR 2023] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆61Updated 2 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆97Updated 10 months ago
- ☆61Updated 9 months ago
- LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft☆44Updated 10 months ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆24Updated last year
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Updated 5 months ago
- SMG source code and dataset☆17Updated 2 years ago
- An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis (CVPR'21)☆44Updated 2 years ago
- An implementation of the paper "End-to-End Human-Gaze-Target Detection with Transformers"☆18Updated 5 months ago
- This is the official implementation of 2023 ICCV paper "EmoSet: A large-scale visual emotion dataset with rich attributes".☆50Updated last year