zhao-chunyu / SaliencyMambaLinks
[AAAI’2025] SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention
☆37Updated last week
Alternatives and similar repositories for SaliencyMamba
Users that are interested in SaliencyMamba are comparing it to the libraries listed below
Sorting:
- Quality-aware multimodal fusion on ICML 2023☆109Updated last month
- ECCV Workshop☆9Updated 2 years ago
- Code for dmrnet☆26Updated 3 weeks ago
- PyTorch Implementation of Deep Equilibrium Multimodal Fusion☆21Updated 2 years ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆25Updated 5 months ago
- ReViT - Residual Attention Vision Transformer☆32Updated last year
- A novel cross-modal decoupling and alignment framework for multimodal representation learning.☆28Updated 4 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆29Updated last year
- This is the official implemantation of “Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Re…☆17Updated 3 years ago
- Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model☆21Updated last year
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆24Updated 7 months ago
- Code for the paper 'Dynamic Multimodal Fusion'☆114Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆55Updated last year
- ☆47Updated 7 months ago
- GroupMixAttention and GroupMixFormer☆117Updated last year
- ☆20Updated 4 months ago
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆71Updated 3 months ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Updated last year
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆53Updated 9 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆44Updated last year
- Implementation of ViViT: A Video Vision Transformer - Zipping Coding Challenge☆32Updated 4 years ago
- Code for paper Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions.☆46Updated last year
- ☆85Updated last year
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆39Updated 2 years ago
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆212Updated last year
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆60Updated 7 months ago
- ☆145Updated last year
- [ECCV 2022] LAFF for Text-to-Video Retrieval☆45Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆126Updated 2 years ago
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆58Updated 2 years ago