cvlab-stonybrook / HATLinks
CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"
☆20Updated last month
Alternatives and similar repositories for HAT
Users that are interested in HAT are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆22Updated 4 years ago
- Scanpath metrics in Python☆30Updated 4 years ago
- Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"☆50Updated 4 years ago
- Visual Scanpath Prediction using IOR-ROI Recurrent Mixture Density Network☆28Updated 4 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆36Updated last year
- ☆56Updated 4 years ago
- Implementation of Graph Based Visual Saliency algorithm by J. Harel, C. Koch, and P. Perona☆53Updated 5 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆55Updated 3 months ago
- Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning (CVPR2020)☆110Updated 2 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆21Updated 2 years ago
- Python Framework for Saliency Modeling and Evaluation☆165Updated 2 weeks ago
- PathGan: Visual Scan-path Prediction with Generative Adversarial Networks☆40Updated 2 years ago
- ☆28Updated last year
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 5 months ago
- Temporal recurrences for video saliency prediction (BMVC 2019)☆6Updated 5 years ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- Pytorch Implementation of paper "Object-Centric Learning with Slot Attention"☆97Updated last year
- Code for evaluating models in the MIT/Tuebingen saliency benchmark☆27Updated 7 months ago
- ☆11Updated 5 months ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Updated 6 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆34Updated 4 years ago
- When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition (CVIU 2022)☆13Updated 2 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆49Updated last year
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- ☆71Updated last year
- ☆23Updated 2 years ago
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆118Updated last year
- PyTorch Implementation for "Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Quer…☆20Updated 4 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆165Updated 4 years ago