cvlab-stonybrook / HATLinks
CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"
☆21Updated 6 months ago
Alternatives and similar repositories for HAT
Users that are interested in HAT are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆22Updated 4 years ago
- Scanpath metrics in Python☆31Updated 4 years ago
- Visual Scanpath Prediction using IOR-ROI Recurrent Mixture Density Network☆29Updated 4 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆42Updated last year
- Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"☆51Updated 4 years ago
- ☆56Updated 5 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆63Updated 8 months ago
- Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning (CVPR2020)☆118Updated 3 years ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated last year
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆22Updated 2 years ago
- PathGan: Visual Scan-path Prediction with Generative Adversarial Networks☆42Updated 2 years ago
- Python Framework for Saliency Modeling and Evaluation☆172Updated 5 months ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆106Updated 2 years ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆15Updated 10 months ago
- Code for evaluating models in the MIT/Tuebingen saliency benchmark☆27Updated last year
- Implementation of Graph Based Visual Saliency algorithm by J. Harel, C. Koch, and P. Perona☆56Updated 6 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Updated 6 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆35Updated 4 years ago
- [ECCV 2024] Prompting Language-Informed Distribution for Compositional Zero-Shot Learning☆15Updated last year
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Updated 3 years ago
- Official repository for CVPR 2024 paper "Advancing Saliency Ranking with Human Fixations: Dataset, Models and Benchmarks".☆19Updated last year
- ☆31Updated 2 years ago
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆22Updated last year
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆18Updated 2 years ago
- ☆48Updated 2 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆73Updated 3 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆119Updated 2 years ago
- [CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild☆49Updated 3 years ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated 2 years ago