We present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. Our trained model achieves state-of-the-art results across several benchmarks.
☆17Jan 18, 2025Updated last year
Alternatives and similar repositories for MDS-ViTNet
Users that are interested in MDS-ViTNet are comparing it to the libraries listed below
Sorting:
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆62Jul 25, 2024Updated last year
- The pytorch implementation of STSANet (non-official)☆11Feb 14, 2023Updated 3 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆29May 26, 2024Updated last year
- Code for evaluating models in the MIT/Tuebingen saliency benchmark☆28Jan 7, 2025Updated last year
- ☆11Mar 11, 2024Updated last year
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆89Aug 23, 2025Updated 6 months ago
- an online variant of AVrateNG☆14Mar 20, 2025Updated 11 months ago
- Code for SCOUT (Task- and Context Modulated Attention for driving) and extended annotations for DR(eye)VE, BDD-A, and LBW datasets☆14Oct 24, 2024Updated last year
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- Tumor Dynamic Neural-ODE☆10Jan 29, 2024Updated 2 years ago
- ☆12Jun 2, 2025Updated 9 months ago
- This project employs the GFPGAN algorithm to upscale and restore images. The tool leverages state-of-the-art deep learning models to enha…☆13Jun 10, 2024Updated last year
- Efficient Feature Extraction for High-resolution Video Frame Interpolation (BMVC 2022)☆13Aug 24, 2023Updated 2 years ago
- ☆12Sep 6, 2023Updated 2 years ago
- Official implementation of "Unified Diffusion Transformer for High-Fidelity Text-Aware Image Restoration"☆26Dec 22, 2025Updated 2 months ago
- Nvidia TensorRT implementation of AdderNet for edge deployment☆10Nov 19, 2020Updated 5 years ago
- Responsible Visual Editing☆15Jul 10, 2024Updated last year
- ☆11Jun 27, 2022Updated 3 years ago
- A Torch implementation of attribute compression module from CNeT paper (T-CSVT)☆14Apr 19, 2023Updated 2 years ago
- ☆12Jan 26, 2023Updated 3 years ago
- Predicting Ovarian Cancer Treatment Response in Histopathology using Hierarchical Vision Transformers and Multiple Instance Learning☆12Nov 29, 2023Updated 2 years ago
- This repo is the official implementation of "Accurate detection of ST-segment and J point deviation from 12-lead Holter ECG using deep ne…☆12May 28, 2022Updated 3 years ago
- A Webpage Saliency Prediction model via 2 staged Transfer Learning using FCN-16s architecture☆12Mar 4, 2020Updated 6 years ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆14May 28, 2024Updated last year
- U-Net TF2☆14Jun 13, 2020Updated 5 years ago
- ☆15Oct 19, 2021Updated 4 years ago
- ECG delineator network for ambulatory 2-lead recordings☆13Feb 14, 2020Updated 6 years ago
- Pytorch Implementation of "PoSNet: 4x video frame interpolation using position-specific flow"☆10Mar 23, 2020Updated 5 years ago
- CLEF: Clinically-Guided Contrastive Learning for Electrocardiogram Foundation Models☆37Dec 11, 2025Updated 2 months ago
- Official code for Sebica☆14Feb 6, 2025Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆15Mar 11, 2025Updated 11 months ago
- Python programming examples using PLUX's API.☆24Oct 29, 2025Updated 4 months ago
- ☆52Nov 27, 2021Updated 4 years ago
- Fully atumated digitization of paper/PDF electrocardiograms☆16Jun 12, 2025Updated 8 months ago
- ☆11May 8, 2021Updated 4 years ago
- Automatic Metric for Evaluating Generated Videos☆33Dec 8, 2025Updated 2 months ago
- UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space☆40Oct 17, 2025Updated 4 months ago
- Code for TMM paper "Horizontal-to-Vertical Video Conversion"☆14Jun 22, 2021Updated 4 years ago
- [TCSVT 2025] Core codes for "SSP-IR: Semantic and Structure Priors for Diffusion-based Realistic Image Restoration"☆17Feb 14, 2025Updated last year