cmsflash / efficient-attentionLinks

An implementation of the efficient attention module.

☆320

Alternatives and similar repositories for efficient-attention

Users that are interested in efficient-attention are comparing it to the libraries listed below

Sorting:

ShoufaChen / CycleMLP
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
☆289Updated 3 years ago
Meituan-AutoML / CPVT
☆192Updated 2 years ago
lucidrains / transformer-in-transformer
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆305Updated 3 years ago
lucidrains / axial-attention
Implementation of Axial attention - attending to multi-dimensional data efficiently
☆384Updated 3 years ago
rishikksh20 / MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
☆218Updated 4 years ago
Gsunshine / Enjoy-Hamburger
[ICLR 2021 top 3%] Is Attention Better Than Matrix Decomposition?
☆334Updated 2 years ago
lucidrains / halonet-pytorch
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Updated 4 years ago
microsoft / vision-longformer
☆248Updated 3 years ago
raoyongming / GFNet
[NeurIPS 2021] [T-PAMI] Global Filter Networks for Image Classification
☆484Updated 2 years ago
houqb / VisionPermutator
MLP-Like Vision Permutator for Visual Recognition (PyTorch)
☆191Updated 3 years ago
ChristophReich1996 / MaxViT
PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [ECCV 2022].
☆163Updated 2 years ago
rishikksh20 / FNet-pytorch
Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms
☆259Updated 4 years ago
david-knigge / ccnn
Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…
☆184Updated 2 months ago
wzlxjtu / PositionalEncoding2D
A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.
☆253Updated 4 years ago
lucidrains / linformer
Implementation of Linformer for Pytorch
☆294Updated last year
haofanwang / awesome-mlp-papers
Recent Advances in MLP-based Models (MLP is all you need!)
☆116Updated 2 years ago
microsoft / Focal-Transformer
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
☆557Updated 3 years ago
locuslab / convmixer
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
☆1,076Updated 2 years ago
liuruiyang98 / Jittor-MLP
Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…
☆170Updated 3 years ago
moabarar / qna
[CVPR2022 - Oral] Official Jax Implementation of Learned Queries for Efficient Local Attention
☆119Updated 3 years ago
lucidrains / deformable-attention
Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"
☆346Updated 6 months ago
lucidrains / global-self-attention-network
A Pytorch implementation of Global Self-Attention Network, a fully-attention backbone for vision tasks
☆95Updated 4 years ago
VITA-Group / SLaK
[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers…
☆275Updated 2 years ago
Tony-Y / pytorch_warmup
Learning Rate Warmup in PyTorch
☆410Updated last month
sail-sg / iFormer
iFormer: Inception Transformer
☆248Updated 2 years ago
ofsoundof / LocalViT
☆119Updated 3 years ago
Atten4Vis / DemystifyLocalViT
Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight
☆186Updated 2 years ago
Meituan-AutoML / Twins
Two simple and effective designs of vision transformer, which is on par with the Swin transformer
☆604Updated 2 years ago
microsoft / SPACH
☆199Updated last year
zzd1992 / Image-Local-Attention
A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
☆141Updated 3 years ago