zhouchenlin2096 / Awesome-Transformer-for-Vision-RecognitionLinks
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
☆18Updated 2 years ago
Alternatives and similar repositories for Awesome-Transformer-for-Vision-Recognition
Users that are interested in Awesome-Transformer-for-Vision-Recognition are comparing it to the libraries listed below
Sorting:
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆29Updated 5 months ago
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆24Updated last year
- Codes of the paper: SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks (CVPR2024)☆61Updated 7 months ago
- ☆67Updated last year
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 9 months ago
- PyTorch Implementation of Spiking Transformer with Spatial-Temporal Attention (CVPR 2025)☆36Updated 2 months ago
- ☆29Updated 4 months ago
- [ECCV-24] Spiking Wavelet Transformer☆29Updated 3 weeks ago
- Trainable Highly-expressive Activation Functions. ECCV 2024☆37Updated 5 months ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆107Updated 11 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆68Updated 2 months ago
- Code of "SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising"☆47Updated 2 months ago
- ☆148Updated 11 months ago
- Scattering Vision Transformer☆54Updated last year
- ☆32Updated last year
- Official implementation for "Deep Fractional Fourier Transform " [NeurIPS 2023]☆39Updated last year
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆40Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆95Updated 11 months ago
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆18Updated last year
- ReViT - Residual Attention Vision Transformer☆32Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆76Updated 7 months ago
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆120Updated last year
- Offical code of "QKFormer: Hierarchical Spiking Transformer using Q-K Attention" (NeurIPS 2024,Spotlight 3%)☆124Updated 7 months ago
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆50Updated last year
- Offical implementation of "Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation" (AAAI2025 Oral)☆20Updated 6 months ago
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆26Updated last year
- Orthogonal Channel Attentions Networks☆54Updated last year
- ☆50Updated 3 months ago
- Learning A Spiking Neural Network for Efficient Image Deraining (IJCAI 2024)☆63Updated last month
- Weighted Convolution 2.0☆44Updated 2 weeks ago