aimagelab / awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆45Updated 3 months ago
Alternatives and similar repositories for awesome-human-visual-attention:
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
- ☆10Updated last week
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆32Updated 11 months ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 3 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆16Updated 5 months ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆62Updated 2 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆22Updated last week
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆13Updated 9 months ago
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023☆60Updated 4 months ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆19Updated last year
- Target-absent Human Attention (ECCV2022)☆17Updated 2 years ago
- Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization (BMVC 2024 Oral ✨)☆16Updated 5 months ago
- Scanpath metrics in Python☆31Updated 3 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆95Updated 8 months ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated last week
- pytorch implementation of the different DeepGaze models☆126Updated last year
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆59Updated this week
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆28Updated last year
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆38Updated 2 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆128Updated last year
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆22Updated 10 months ago
- Improving neural network representations using human similarity judgments☆14Updated 3 months ago
- ☆54Updated 7 months ago
- SMG source code and dataset☆16Updated last year
- ☆49Updated 9 months ago
- ☆75Updated last year
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆50Updated 5 months ago
- An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis (CVPR'21)☆41Updated 2 years ago
- Official implementation of "Everything at Once - Multi-modal Fusion Transformer for Video Retrieval". CVPR 2022☆99Updated 2 years ago
- With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023☆16Updated 8 months ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆48Updated last year