Arhosseini77 / SUM
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆65Updated 2 weeks ago
Alternatives and similar repositories for SUM:
Users that are interested in SUM are comparing it to the libraries listed below
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated last month
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆14Updated last year
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆29Updated 8 months ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆50Updated 4 months ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆22Updated 11 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆33Updated last year
- [IEEE TCSVT] Vivim: a Video Vision Mamba for Medical Video Segmentation☆169Updated this week
- Official code for NeurIPS 2023 paper "Self-Supervised Motion Magnification by Backpropagating Through Optical Flow"☆33Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 3 months ago
- ☆31Updated last year
- [WACV 2024] INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings☆42Updated 3 months ago
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆12Updated 10 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆187Updated 9 months ago
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆61Updated last year
- The pytorch implementation of STSANet (non-official)☆10Updated 2 years ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆74Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆361Updated 5 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆75Updated last year
- LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)☆34Updated last week
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆56Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆97Updated 11 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆92Updated last month
- [ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"☆77Updated 8 months ago
- [2023-CVPR] ScanDMM: A Deep Markov Model of Scanpath Prediction for 360-degree Images☆19Updated last year
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆117Updated last year
- ☆65Updated 6 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆75Updated 10 months ago
- 2D discrete Wavelet Transform for Image Classification and Segmentation☆87Updated 3 months ago
- This repo is the official implementation of iSeg: An Iterative Refinement-based Framework for Training-free Segmentation.☆36Updated 5 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆122Updated 4 months ago