Arhosseini77 / SUM
[WACV2025] SUM: Saliency Unification through Mamba for Visual Attention Modeling
☆33Updated 3 weeks ago
Related projects: ⓘ
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated 7 months ago
- Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis☆26Updated last month
- This repository contains a curated list of research papers and resources focusing on saliency and scanpath prediction, human attention, h…☆34Updated last week
- [CVPR 2024 Highlight] Official PyTorch implementation of "MindBridge: A Cross-Subject Brain Decoding Framework"☆66Updated 3 weeks ago
- Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)☆25Updated 6 months ago
- A human-computer interaction system that combines eye tracking with Segment Anything Model (SAM), and it enables users to segment object …☆49Updated last year
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆18Updated 3 months ago
- [NeurIPS2022] Mind Reader: Reconstructing complex images from brain activities☆59Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆84Updated 4 months ago
- Official code for NeurIPS 2023 paper "Self-Supervised Motion Magnification by Backpropagating Through Optical Flow"☆23Updated 6 months ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆22Updated 3 months ago
- ☆15Updated last year
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆52Updated 5 months ago
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆10Updated 4 months ago
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆25Updated 2 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆66Updated last year
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆55Updated 5 months ago
- [WACV 2024] INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings☆36Updated 2 months ago
- [ECCV 2024] PyTorch implementation of CropMAE, introduced in "Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders"☆43Updated 2 months ago
- PyTorch implementation of "Brain Decodes Deep Nets"☆50Updated 7 months ago
- ☆40Updated 3 months ago
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆86Updated last month
- Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".☆19Updated 3 weeks ago
- ☆9Updated this week
- ☆37Updated 8 months ago
- ☆19Updated this week
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆20Updated 4 months ago
- ☆14Updated 9 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆101Updated last year
- ECCV 2024 paper template☆47Updated 7 months ago