Arhosseini77 / SUMLinks
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆70Updated 2 months ago
Alternatives and similar repositories for SUM
Users that are interested in SUM are comparing it to the libraries listed below
Sorting:
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 3 months ago
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆30Updated last month
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆14Updated last year
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆16Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆54Updated last month
- The pytorch implementation of STSANet (non-official)☆11Updated 2 years ago
- We present a novel methodology we call MDS-ViTNet (Multi Decoder Saliency by Vision Transformer Network) for enhancing visual saliency pr…☆16Updated 5 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆96Updated 9 months ago
- List of papers related to State Space Models (Mamba) in Vision.☆38Updated 11 months ago
- Official code and dataset of MVFormer☆8Updated last year
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆122Updated last year
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆36Updated last year
- Code of "LMLT: Low-to-high Multi-Level Vision Transformer for Image Super-Resolution".☆23Updated 8 months ago
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆175Updated 2 weeks ago
- [ICLR'25] AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation☆168Updated last month
- TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)☆57Updated 11 months ago
- ☆19Updated 2 years ago
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆12Updated 3 years ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆20Updated 2 years ago
- HINT: High-quality INpainting Transformer with Enhanced Attention and Mask-aware Encoding☆40Updated 5 months ago
- [WACV 2024] INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings☆44Updated 5 months ago
- ☆22Updated 2 years ago
- ☆50Updated last week
- [ACM MM 2023] Mask-Guided Progressive Network for Joint Raindrop and Rain Streak Removal in Videos☆14Updated 11 months ago
- High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity (ICLR2025)☆30Updated last month
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆66Updated last year
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated 4 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆100Updated 3 months ago