fawazsammani / awesome-mlp-mixerLinks
Transformers w/o Attention, based fully on MLPs
☆94Updated last year
Alternatives and similar repositories for awesome-mlp-mixer
Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Updated 2 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Updated 2 years ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆78Updated last year
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆80Updated 2 years ago
- A simple minimal implementation of Reversible Vision Transformers☆125Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- Recent Advances on Efficient Vision Transformers☆53Updated 2 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆44Updated 4 years ago
- ☆66Updated 10 months ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- ☆25Updated 3 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆61Updated 2 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆52Updated 3 years ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆43Updated last year
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆141Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 3 months ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆73Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆68Updated last year
- Differentiable Top-k Classification Learning☆85Updated 2 years ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated 2 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆218Updated 4 years ago
- ☆47Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆56Updated 10 months ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- ☆51Updated last year