zhao-chunyu / SaliencyMambaLinks
[AAAI’2025] SalM²: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention
☆44Updated last month
Alternatives and similar repositories for SaliencyMamba
Users that are interested in SaliencyMamba are comparing it to the libraries listed below
Sorting:
- Quality-aware multimodal fusion on ICML 2023☆111Updated 3 months ago
- Deep Correlated Prompting for Visual Recognition with Missing Modalities (NeurIPS 2024)☆29Updated 7 months ago
- A list of papers, codes and applications on multi-task learning.☆76Updated last week
- Code for dmrnet☆28Updated 2 months ago
- VadCLIP official Pytorch implementation☆177Updated last year
- GroupMixAttention and GroupMixFormer☆116Updated last year
- This is official github repo for InReview paper "MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Seg…☆14Updated 5 months ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆69Updated last year
- A novel cross-modal decoupling and alignment framework for multimodal representation learning.☆34Updated 6 months ago
- [ICML 2024] Official implementation for "Predictive Dynamic Fusion."☆64Updated 8 months ago
- ☆152Updated last year
- ☆85Updated 2 years ago
- Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"☆93Updated 9 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆31Updated last year
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆90Updated 2 years ago
- [AAAI 2025] Enhance Vision-Language Alignment with Noise☆24Updated 9 months ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆46Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆130Updated 2 years ago
- arxiv-daily☆82Updated 4 years ago
- [ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception☆44Updated last year
- ReViT - Residual Attention Vision Transformer☆33Updated last year
- ☆147Updated last year
- A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection☆18Updated last year
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆77Updated 2 months ago
- MAE for CIFAR,由于可用资源有限,我们仅在 cifar10 上测试模型。我们主要想重现这样的结果:使用 MAE 预训练 ViT 可以比直接使用标签进行监督学习训练获得更好的结果。这应该是自我监督学习比监督学习更有效的数据的证据。☆78Updated 2 years ago
- We proposed a large-scale benchmark for traffic accidents detection from video surveillance☆14Updated 9 months ago
- Code and Dataset for the paper "LAMM: Label Alignment for Multi-Modal Prompt Learning" AAAI 2024☆33Updated last year
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆41Updated 2 years ago
- Code for the paper 'Dynamic Multimodal Fusion'☆114Updated 2 years ago
- ☆10Updated 2 years ago