zhouchenlin2096 / Awesome-Transformer-for-Vision-RecognitionLinks
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
☆18Updated last year
Alternatives and similar repositories for Awesome-Transformer-for-Vision-Recognition
Users that are interested in Awesome-Transformer-for-Vision-Recognition are comparing it to the libraries listed below
Sorting:
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆28Updated 5 months ago
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆24Updated last year
- ☆65Updated last year
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 8 months ago
- Official implementation for "Deep Fractional Fourier Transform " [NeurIPS 2023]☆38Updated last year
- Orthogonal Channel Attentions Networks☆53Updated last year
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆17Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆75Updated 6 months ago
- Trainable Highly-expressive Activation Functions. ECCV 2024☆37Updated 4 months ago
- ☆147Updated 10 months ago
- Scattering Vision Transformer☆52Updated last year
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆50Updated last year
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆116Updated last year
- Weighted Convolution 2.0☆36Updated last month
- Code for the paper: "FusionMamba: Efficient Image Fusion with State Space Model", TGRS, 2024.☆115Updated 4 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆67Updated 2 months ago
- [ECCV-24] Spiking Wavelet Transformer☆27Updated this week
- ☆85Updated last year
- Vision Mamba: A Comprehensive Survey and Taxonomy☆95Updated 10 months ago
- [ICCV2025] Official Pytorch Implementation of TinyViM☆56Updated 3 weeks ago
- This is the offical repository for "Multi-modal Gated Mixture of Local-to-Global Experts for Dynamic Image Fusion" (ICCV 2023).☆62Updated last year
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆13Updated 6 months ago
- Code of "SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising"☆44Updated 2 months ago
- 🕹️The toy examples of Kolmogorov-Arnold Network (Get Started Quickly)☆75Updated last year
- FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba☆179Updated 3 months ago
- Ofiicial Implementation for Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data☆61Updated last year
- The official implementation for ALOFT (CVPR 2023).☆55Updated last year
- ☆48Updated 2 months ago
- AAAI 2022 (Official implementation of "pan-sharpening with customized transformer and invertible neural network")☆14Updated 2 years ago
- AFFNet-Unofficial Implementation☆15Updated last year