aimagelab / awesome-human-visual-attention
This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, human visual search.
☆44Updated last month
Alternatives and similar repositories for awesome-human-visual-attention:
Users that are interested in awesome-human-visual-attention are comparing it to the libraries listed below
- ☆10Updated 2 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆31Updated 10 months ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆21Updated 3 years ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆61Updated 2 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆16Updated 4 months ago
- pytorch implementation of the different DeepGaze models☆121Updated last year
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023☆58Updated 3 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆21Updated 5 months ago
- Scanpath metrics in Python☆31Updated 3 years ago
- [WACV2025] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆56Updated last month
- Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization (BMVC 2024 Oral ✨)☆16Updated 4 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆25Updated 8 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- Training A Small Emotional Vision Language Model for Visual Art Comprehension☆14Updated 6 months ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated 8 months ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆38Updated last month
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆18Updated last year
- Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024☆52Updated 5 months ago
- Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.☆18Updated 2 years ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆31Updated 10 months ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆12Updated 2 months ago
- Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024☆50Updated 3 months ago
- Improving neural network representations using human similarity judgments☆14Updated 2 months ago
- [NeurIPS2024 D&B Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆22Updated 4 months ago
- ☆61Updated last year
- Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.☆56Updated 4 months ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆21Updated 9 months ago
- ☆51Updated 6 months ago
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆127Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆93Updated 6 months ago