IgnatPolezhaev / MDS-ViTNetLinks
We present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. Our trained model achieves state-of-the-art results across several benchmarks.
☆16Updated 4 months ago
Alternatives and similar repositories for MDS-ViTNet
Users that are interested in MDS-ViTNet are comparing it to the libraries listed below
Sorting:
- Official code and dataset of MVFormer☆9Updated last year
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 2 months ago
- ☆22Updated 2 years ago
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆57Updated 10 months ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆12Updated last year
- Code for evaluating models in the MIT/Tuebingen saliency benchmark☆26Updated 5 months ago
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆69Updated last month
- The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"☆24Updated 2 years ago
- ☆56Updated 4 years ago
- [Pattern Recognition]Video Saliency Prediction using Enhanced Spatiotemporal Alignment Network☆25Updated 4 years ago
- pytorch implementation of the different DeepGaze models☆144Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆53Updated 3 weeks ago
- Unified Image and Video Saliency Modeling (ECCV 2020)☆142Updated 10 months ago
- ☆19Updated 2 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆36Updated last year
- The large-scale eye-tracking database called LEDOV for video salinecy☆18Updated 5 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆20Updated 2 years ago
- CVPR 2024 "Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers"☆19Updated 8 months ago
- ☆21Updated 3 years ago
- STRAL-Net☆23Updated 6 months ago
- Saliency prediction on 360° image with SalGAN☆16Updated 4 years ago
- Python Framework for Saliency Modeling and Evaluation☆161Updated this week
- Implementation of A Deep Multi-Level Network for Saliency Prediction in Pytorch☆31Updated 6 years ago
- Pytorch version of Saliency Attentive Model (SAM)☆9Updated 6 years ago
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆67Updated 2 years ago
- Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)☆141Updated 2 years ago
- Official PyTorch implementation of our paper "Spherical Vision Transformer for 360° Video Saliency Prediction" (BMVC 2023)☆16Updated last year
- ☆27Updated last year