Arhosseini77 / SUM
[WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆59Updated this week
Alternatives and similar repositories for SUM:
Users that are interested in SUM are comparing it to the libraries listed below
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated last year
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆13Updated last week
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆14Updated last year
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆45Updated 3 months ago
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆28Updated 6 months ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆21Updated 9 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆100Updated 2 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 10 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆120Updated 2 months ago
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆59Updated last year
- ☆49Updated 9 months ago
- ☆48Updated 8 months ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆62Updated 2 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆22Updated last week
- Vivim: a Video Vision Mamba for Medical Video Segmentation☆162Updated 4 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆70Updated 10 months ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆32Updated 11 months ago
- List of papers related to State Space Models (Mamba) in Vision.☆39Updated 8 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆65Updated 10 months ago
- WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in co…☆95Updated 6 months ago
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆45Updated 4 months ago
- ☆10Updated last week
- ☆122Updated 8 months ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆36Updated 11 months ago
- Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…☆51Updated last year
- 1-shot image segmentation using Stable Diffusion☆134Updated 11 months ago
- ECCV 2024 paper template☆50Updated last year
- ☆28Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated this week
- [CVPR2023] Code for "Streaming Video Model"☆78Updated last year