IgnatPolezhaev / MDS-ViTNetLinks
We present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. Our trained model achieves state-of-the-art results across several benchmarks.
☆17Updated last year
Alternatives and similar repositories for MDS-ViTNet
Users that are interested in MDS-ViTNet are comparing it to the libraries listed below
Sorting:
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆15Updated 11 months ago
- ☆26Updated 2 years ago
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆62Updated last year
- The official PyTorch implementation for paper "Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction"☆27Updated 2 years ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆14Updated last year
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆88Updated 5 months ago
- ☆12Updated 3 years ago
- [Pattern Recognition]Video Saliency Prediction using Enhanced Spatiotemporal Alignment Network☆25Updated 4 years ago
- Unified Image and Video Saliency Modeling (ECCV 2020)☆152Updated last year
- ☆53Updated 5 months ago
- ECCV-AIM 2024 Challenge on Video Saliency Prediction☆31Updated last year
- Official PyTorch implementation of our paper "Spherical Vision Transformer for 360° Video Saliency Prediction" (BMVC 2023)☆21Updated last year
- ☆28Updated 2 years ago
- Code for evaluating models in the MIT/Tuebingen saliency benchmark☆28Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆29Updated last year
- [CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio☆489Updated 2 years ago
- ☆12Updated 8 months ago
- Official implementation of IJCAI 2023 paper "Video Frame Interpolation with Densely Queried Bilateral Correlation".☆33Updated 2 years ago
- ☆21Updated 2 years ago
- The large-scale eye-tracking database called LEDOV for video salinecy☆19Updated 6 years ago
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆13Updated 3 years ago
- ☆56Updated 5 years ago
- [NeurIPS'23] The official implementation of paper "Bitstream-corrupted Video Recovery: A Novel Benchmark Dataset and Method"☆42Updated 6 months ago
- ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction☆75Updated 6 months ago
- Contextual Encoder-Decoder Network for Visual Saliency Prediction [Neural Networks 2020]☆205Updated last year
- [CVPR2024] Dataset and Code of "CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement".☆14Updated last year
- Source codes for "Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment"☆19Updated 4 years ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆43Updated last year
- This is the PyTorch implementation of paper: FSR (AAAI 2023 Oral).☆12Updated 2 years ago