IgnatPolezhaev / MDS-ViTNet

We present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency prediction or eye-tracking. Our trained model achieves state-of-the-art results across several benchmarks.
15Updated 4 months ago

Related projects

Alternatives and complementary repositories for MDS-ViTNet