zhouchenlin2096 / Awesome-Transformer-for-Vision-RecognitionLinks
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
☆18Updated last year
Alternatives and similar repositories for Awesome-Transformer-for-Vision-Recognition
Users that are interested in Awesome-Transformer-for-Vision-Recognition are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆24Updated last year
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆26Updated 4 months ago
- ☆64Updated last year
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆21Updated 8 months ago
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆39Updated last year
- Official implementation of ParCNetV2☆10Updated last year
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆13Updated 5 months ago
- Official implementation of SPANet in ICCV2023☆23Updated last month
- Scattering Vision Transformer☆50Updated last year
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆16Updated last year
- ☆23Updated last year
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆64Updated last month
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆31Updated 5 months ago
- Orthogonal Channel Attentions Networks☆53Updated last year
- ☆51Updated 8 months ago
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆49Updated last year
- Trainable Highly-expressive Activation Functions. ECCV 2024☆38Updated 4 months ago
- Codes of the paper: SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks (CVPR2024)☆59Updated 6 months ago
- Code of "SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising"☆44Updated last month
- Vision Mamba 2: More Efficient Visual Representation Learning with State Space Duality☆26Updated last year
- Official implementation for "Deep Fractional Fourier Transform " [NeurIPS 2023]☆37Updated last year
- [ECCV-24] Spiking Wavelet Transformer☆27Updated 5 months ago
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆72Updated 6 months ago
- ☆49Updated 9 months ago
- NTIRE Workshop and Challenges @ CVPR 2024☆38Updated 9 months ago
- ☆24Updated 3 months ago
- ☆19Updated last year
- Offical implementation of "Quantized Spike-driven Transformer" (ICLR2025)☆24Updated 2 months ago
- The code of LadleNet and LadleNet+☆12Updated last year
- Official implementation of the IEEE TGRS'24 paper "Lost in UNet: Improving Infrared Small Target Detection by Underappreciated Local Feat…☆12Updated 2 months ago