Arhosseini77 / SUM
[WACV2025] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆54Updated last month
Alternatives and similar repositories for SUM:
Users that are interested in SUM are comparing it to the libraries listed below
- Official repository for the paper "TempSAL - Uncovering Temporal Information for Deep Saliency Prediction" (CVPR 2023)☆12Updated 2 months ago
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆13Updated 11 months ago
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆28Updated 5 months ago
- [ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation☆99Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆21Updated 7 months ago
- 🔥 Pytorch implementation for a feedback saliency detection model (SalFBNet)☆19Updated 7 months ago
- Ensemble Neural Representation Networks☆11Updated 3 years ago
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆44Updated last month
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆56Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆99Updated last month
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆97Updated 8 months ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆54Updated 6 months ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆26Updated 3 weeks ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 8 months ago
- ☆19Updated last year
- ☆22Updated 3 months ago
- ☆16Updated 2 years ago
- The pytorch implementation of STSANet (non-official)☆9Updated last year
- ☆48Updated 6 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆118Updated 3 weeks ago
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆43Updated 8 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆66Updated 11 months ago
- [WACV 2024] INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings☆41Updated this week
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆81Updated 5 months ago
- A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch☆27Updated 11 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆40Updated last month
- ☆33Updated 11 months ago
- ☆52Updated last year
- ☆34Updated 11 months ago
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆90Updated 9 months ago