Arhosseini77 / SUMLinks
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆82Updated last month
Alternatives and similar repositories for SUM
Users that are interested in SUM are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆14Updated 6 months ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆13Updated last year
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆179Updated 3 months ago
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆122Updated 6 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆121Updated 6 months ago
- [ICCV2025] Introduce Mamba2 to Vision.☆159Updated 3 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆98Updated last year
- 2D discrete Wavelet Transform for Image Classification and Segmentation☆91Updated 8 months ago
- ☆137Updated last year
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆96Updated last year
- ☆42Updated 4 months ago
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆128Updated last year
- The official repo for [TPAMI'23] "Vision Transformer with Quadrangle Attention"☆221Updated this week
- unofficial implementation of DiffMAE☆15Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆380Updated 10 months ago
- ☆22Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆78Updated 2 years ago
- Official code for NeurIPS 2023 paper "Self-Supervised Motion Magnification by Backpropagating Through Optical Flow"☆37Updated last year
- [ECCV2024 - Oral] Adaptive Parametric Activation☆53Updated 6 months ago
- DiffSeg is an unsupervised zero-shot segmentation method using attention information from a stable-diffusion model. This repo implements …☆321Updated last year
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆84Updated 11 months ago
- ☆73Updated 7 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆120Updated 5 months ago
- HINT: High-quality INpainting Transformer with Enhanced Attention and Mask-aware Encoding☆48Updated 8 months ago
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆51Updated 3 months ago
- [WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment☆145Updated last month
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated last year
- ☆85Updated 2 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆56Updated 4 months ago