Arhosseini77 / SUMLinks
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆80Updated last week
Alternatives and similar repositories for SUM
Users that are interested in SUM are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 5 months ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆13Updated last year
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆96Updated last year
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆177Updated 2 months ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆56Updated 3 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆119Updated 5 months ago
- Description: Frequency Augmented Variational Autoencoder for better Image Reconstruction☆41Updated last year
- List of papers related to State Space Models (Mamba) in Vision.☆38Updated last year
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆122Updated 5 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆95Updated last year
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆81Updated 10 months ago
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated last year
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆48Updated 2 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆77Updated 2 years ago
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆129Updated last year
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆63Updated last year
- ☆85Updated 2 years ago
- official repo for the paper "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"☆47Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆84Updated 3 months ago
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆218Updated last year
- 2D discrete Wavelet Transform for Image Classification and Segmentation☆91Updated 7 months ago
- ☆72Updated 6 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆100Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆117Updated 4 months ago
- [CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"☆213Updated last year
- A curated list of awesome resources for salient object detection (SOD), focusing more on multi-modal SOD, such as RGB-D SOD.☆129Updated 11 months ago
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆46Updated 7 months ago
- ☆42Updated 3 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆206Updated last year
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆61Updated 8 months ago