Arhosseini77 / SUM
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆63Updated last month
Alternatives and similar repositories for SUM:
Users that are interested in SUM are comparing it to the libraries listed below
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated 3 weeks ago
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆29Updated 7 months ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆49Updated 4 months ago
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆14Updated last year
- Code for "Saliency Prediction of Sports Videos: A Large-Scale Database and a Self-Adaptive Approach", ICASSP 2024☆12Updated 10 months ago
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆60Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆21Updated 10 months ago
- List of papers related to State Space Models (Mamba) in Vision.☆39Updated 9 months ago
- Vivim: a Video Vision Mamba for Medical Video Segmentation☆167Updated 5 months ago
- ☆16Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆96Updated 11 months ago
- [AAAI2025] Official Implementation of "FOCUS: Towards Universal Foreground Segmentation"☆24Updated 2 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆40Updated 6 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆33Updated last year
- The pytorch implementation of STSANet (non-official)☆10Updated 2 years ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆79Updated last year
- Code for ''MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation''☆28Updated last year
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆343Updated 7 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆95Updated 7 months ago
- Implementation of paper 'Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models'☆67Updated last year
- [WACV 2024] INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings☆42Updated 2 months ago
- [CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"☆196Updated 10 months ago
- [CVPR 2024 Highlight] Official PyTorch implementation of "MindBridge: A Cross-Subject Brain Decoding Framework"☆95Updated 3 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆54Updated last month
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆114Updated last year
- This is the official code release for our work, Denoising Vision Transformers.☆357Updated 4 months ago
- Fake It Till You Make It: Near-Distribution Novelty Detection by Score-Based Generative Models☆24Updated 2 years ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆120Updated 3 months ago
- GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [CVPR -2025]☆84Updated last week
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆60Updated 10 months ago