cvlab-stonybrook / GazeformerLinks
Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)
☆36Updated last year
Alternatives and similar repositories for Gazeformer
Users that are interested in Gazeformer are comparing it to the libraries listed below
Sorting:
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆55Updated 3 months ago
- Official code for the paper "Predicting Human Scanpaths in Visual Question Answering"☆22Updated 4 years ago
- Official repo of the paper "Object-aware Gaze Target Detection" (ICCV 2023)☆41Updated 8 months ago
- Scanpath metrics in Python☆30Updated 4 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆20Updated last month
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆21Updated 2 years ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Updated 2 years ago
- [CVPR2022] MS-TCT☆55Updated 2 years ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 5 months ago
- ☆11Updated 5 months ago
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆27Updated 5 months ago
- Official Repository for ECCV 2020 paper "AiR: Attention with Reasoning Capability"☆50Updated 4 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆46Updated last year
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆91Updated 2 years ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆20Updated 6 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Updated last year
- FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment☆140Updated 11 months ago
- [TIP 2022] End-to-end Temporal Action Detection with Transformer☆153Updated 2 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆85Updated 2 years ago
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization☆58Updated 2 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆49Updated last year
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆38Updated last year
- Code for Diffusion Action Segmentation (ICCV 2023)☆64Updated last year
- ☆37Updated 3 years ago
- Target-absent Human Attention (ECCV2022)☆18Updated 2 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆86Updated 3 years ago
- Official Implementation of the paper "Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Transl…☆38Updated 2 years ago
- [ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception☆43Updated last year
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆92Updated 10 months ago