zhouchenlin2096 / Awesome-Transformer-for-Vision-RecognitionLinks
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
☆20Updated 2 years ago
Alternatives and similar repositories for Awesome-Transformer-for-Vision-Recognition
Users that are interested in Awesome-Transformer-for-Vision-Recognition are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆27Updated 2 years ago
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆48Updated 10 months ago
- ☆69Updated last year
- Trainable Highly-expressive Activation Functions. ECCV 2024☆38Updated 10 months ago
- ☆44Updated 9 months ago
- [ECCV-24] Spiking Wavelet Transformer☆39Updated 5 months ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 4 months ago
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆40Updated last year
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆18Updated 4 months ago
- Orthogonal Channel Attentions Networks☆53Updated 2 years ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆79Updated last year
- ☆152Updated last year
- PyTorch Implementation of Spiking Transformer with Spatial-Temporal Attention (CVPR 2025)☆64Updated 6 months ago
- Code for the paper: "FusionMamba: Efficient Image Fusion with State Space Model", TGRS, 2024.☆130Updated 10 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆83Updated 2 months ago
- Official implementation for "Deep Fractional Fourier Transform " [NeurIPS 2023]☆43Updated last year
- Code of "SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising"☆51Updated 7 months ago
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆131Updated last year
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆65Updated last year
- Scattering Vision Transformer☆53Updated last year
- ☆66Updated last year
- ☆54Updated 8 months ago
- [ICCV2025] Official Pytorch Implementation of TinyViM☆103Updated 6 months ago
- Vision Mamba: A Comprehensive Survey and Taxonomy☆98Updated last year
- ☆85Updated 2 years ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆187Updated 10 months ago
- ☆26Updated last year
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆54Updated last year
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆31Updated 11 months ago
- Official Pytorch implementation of " Are Vision xLSTM Embedded UNet More Reliable in Medical 3D Image Segmentation? "☆60Updated 5 months ago