Arhosseini77 / SUMLinks
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆82Updated 2 months ago
Alternatives and similar repositories for SUM
Users that are interested in SUM are comparing it to the libraries listed below
Sorting:
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆181Updated 4 months ago
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 7 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆123Updated 7 months ago
- [ICCV2025] Introduce Mamba2 to Vision.☆168Updated this week
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆128Updated 7 months ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆13Updated last year
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆127Updated last year
- 2D discrete Wavelet Transform for Image Classification and Segmentation☆91Updated 9 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆98Updated last year
- ☆152Updated last year
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆98Updated last year
- List of papers related to State Space Models (Mamba) in Vision.☆38Updated last year
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆220Updated last month
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆64Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆383Updated 11 months ago
- A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.☆134Updated last year
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆51Updated 4 months ago
- ☆85Updated 2 years ago
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated last year
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆129Updated last year
- Code Implementation of EfficientVMamba☆231Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆86Updated 5 months ago
- ☆75Updated 8 months ago
- ☆148Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆122Updated 6 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆78Updated 2 years ago
- Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan☆268Updated last year
- ☆238Updated last year
- Official ImageNet Model repository☆256Updated 2 years ago
- [WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation☆262Updated last month