lucidrains / axial-attention
Implementation of Axial attention - attending to multi-dimensional data efficiently
☆372Updated 3 years ago
Alternatives and similar repositories for axial-attention:
Users that are interested in axial-attention are comparing it to the libraries listed below
- An All-MLP solution for Vision, from Google AI☆1,015Updated 6 months ago
- This is a PyTorch re-implementation of Axial-DeepLab (ECCV 2020 Spotlight)☆451Updated 3 years ago
- Tiny PyTorch library for maintaining a moving average of a collection of parameters.☆426Updated 5 months ago
- An implementation of the efficient attention module.☆304Updated 4 years ago
- Self-supervised vIsion Transformer (SiT)☆327Updated 2 years ago
- An implementation of local windowed attention for language modeling☆428Updated 2 months ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 3 years ago
- Implementation of Linformer for Pytorch☆274Updated last year
- [NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification☆471Updated last year
- Implementation of a U-net complete with efficient attention as well as the latest research findings☆275Updated 10 months ago
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆492Updated last year
- Learning Rate Warmup in PyTorch☆404Updated last week
- Implementation of Pixel-level Contrastive Learning, proposed in the paper "Propagate Yourself", in Pytorch☆258Updated 4 years ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆751Updated 10 months ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆648Updated 3 years ago
- Code for the Convolutional Vision Transformer (ConViT)☆466Updated 3 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆1,132Updated last year
- PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers☆226Updated 3 years ago
- A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model☆565Updated 3 months ago
- Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022☆1,096Updated 10 months ago
- Fully featured implementation of Routing Transformer☆289Updated 3 years ago
- Implementation of ConvMixer for "Patches Are All You Need? 🤷"☆1,070Updated 2 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆285Updated 2 years ago
- Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"☆360Updated last year
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 2 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆304Updated 3 years ago
- ☆245Updated 3 years ago
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆988Updated 5 months ago
- PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)☆1,321Updated 9 months ago
- This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.☆573Updated last year