zhouchenlin2096 / Awesome-Transformer-for-Vision-Recognition
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
☆17Updated last year
Alternatives and similar repositories for Awesome-Transformer-for-Vision-Recognition
Users that are interested in Awesome-Transformer-for-Vision-Recognition are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆24Updated last year
- Source code for AAAI 2025 paper: FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks☆20Updated 3 months ago
- Official code release of our paper "EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention"☆20Updated 7 months ago
- Codes of the paper: SpikingResformer: Bridging ResNet and Vision Transformer in Spiking Neural Networks (CVPR2024)☆57Updated 4 months ago
- ☆60Updated last year
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆49Updated last year
- Scattering Vision Transformer☆50Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆104Updated 8 months ago
- ☆24Updated last year
- ☆23Updated 2 months ago
- Orthogonal Channel Attentions Networks☆53Updated last year
- Official implementation of ParCNetV2☆10Updated last year
- Offical implementation of "Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation" (AAAI2025 Oral)☆18Updated 3 months ago
- Code of "SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising"☆43Updated this week
- Trainable Highly-expressive Activation Functions. ECCV 2024☆38Updated 2 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆62Updated this week
- [ECCV-24] Spiking Wavelet Transformer☆27Updated 4 months ago
- NTIRE Workshop and Challenges @ CVPR 2024☆37Updated 7 months ago
- [WACV 2024] Spiking Denoising Diffusion Probabilistic Models☆47Updated last month
- This project is based on Vim (paper, code) and we appreciate this excellent work.☆13Updated 4 months ago
- ☆13Updated last year
- PyTorch code for Diffusion Mechanism in Neural Network: Theory and Applications☆39Updated last year
- Official code release of our paper "FViT: A Focal Vision Transformer with Gabor Filter"☆16Updated last year
- Pan-Mamba: Effective Pan-Sharpening with State Space Model☆111Updated last year
- Official implementation for "Deep Fractional Fourier Transform " [NeurIPS 2023]☆36Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated last year
- ☆21Updated 10 months ago
- The official implementation of the AAAI 2024 paper Bi-ViT.☆10Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆69Updated 4 months ago
- This repo is the relevant code for MAWNO☆17Updated last year