fawazsammani / awesome-mlp-mixerLinks

Transformers w/o Attention, based fully on MLPs

☆94

Alternatives and similar repositories for awesome-mlp-mixer

Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below

Sorting:

ziplab / EcoFormer
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆72Updated 2 years ago
haofanwang / awesome-mlp-papers
Recent Advances in MLP-based Models (MLP is all you need!)
☆116Updated 2 years ago
KingJamesSong / DifferentiableSVD
A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…
☆78Updated last year
karttikeya / minREV
A simple minimal implementation of Reversible Vision Transformers
☆125Updated last year
VITA-Group / ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…
☆80Updated last year
FocalNet / Networks-Beyond-Attention
A compilation of network architectures for vision and others without usage of self-attention mechanism
☆80Updated 2 years ago
rishikksh20 / MLP-Mixer-pytorch
Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision
☆218Updated 4 years ago
okojoalg / raft-mlp
☆25Updated 3 years ago
NVlabs / ConvSSM
☆66Updated 9 months ago
rishikksh20 / ResMLP-pytorch
ResMLP: Feedforward networks for image classification with data-efficient training
☆43Updated 4 years ago
bwconrad / soft-moe
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆59Updated last year
huangleiBuaa / NormalizationSurvey
This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application
☆85Updated 4 years ago
IBM / RegionViT
open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689
☆52Updated 3 years ago
CownowAn / DaSS
Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)
☆24Updated last year
MingSun-Tse / Awesome-Efficient-ViT
Recent Advances on Efficient Vision Transformers
☆51Updated 2 years ago
willGuimont / learnable_fourier_positional_encoding
Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding
☆54Updated 10 months ago
mit-han-lab / sparsevit
[CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
☆73Updated last year
WailordHe / DenseSSM
A repository for DenseSSMs
☆88Updated last year
cmsflash / efficient-attention
An implementation of the efficient attention module.
☆320Updated 4 years ago
Felix-Petersen / difftopk
Differentiable Top-k Classification Learning
☆83Updated 2 years ago
GATECH-EIC / Castling-ViT
[CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference
☆30Updated last year
badripatro / mamba360
State Space Models
☆70Updated last year
BingSu12 / Log-Polar-Space-Convolution
Log-Polar Space Convolution for Convolutional Neural Networks
☆12Updated 2 years ago
princeton-vl / Oriented1D
Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"
☆47Updated last year
UCDvision / sima
Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"
☆43Updated last year
facebookresearch / DeltaCNN
DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos
☆59Updated 2 years ago
lucidrains / bidirectional-cross-attention
A simple cross attention that updates both the source and target in one step
☆172Updated last week
TomerRonen34 / mixed-resolution-vit
☆51Updated last year
NVlabs / A-ViT
Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)
☆160Updated 3 years ago
zzd1992 / Image-Local-Attention
A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.
☆141Updated 3 years ago