fawazsammani / awesome-mlp-mixerLinks
Transformers w/o Attention, based fully on MLPs
☆93Updated last year
Alternatives and similar repositories for awesome-mlp-mixer
Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below
Sorting:
- Recent Advances in MLP-based Models (MLP is all you need!)☆115Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆52Updated 3 years ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆78Updated last year
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆51Updated 9 months ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆80Updated 2 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆58Updated last year
- ☆25Updated 3 years ago
- A simple minimal implementation of Reversible Vision Transformers☆125Updated last year
- ResMLP: Feedforward networks for image classification with data-efficient training☆43Updated 4 years ago
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆85Updated 4 years ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 2 years ago
- ☆66Updated 8 months ago
- ☆47Updated 2 years ago
- Recent Advances on Efficient Vision Transformers☆51Updated 2 years ago
- PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.☆67Updated 2 years ago
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- ☆24Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆184Updated 2 months ago
- Robustness via Cross-Domain Ensembles, ICCV 2021 [Oral]☆39Updated 3 years ago
- Differentiable Top-k Classification Learning☆83Updated 2 years ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆72Updated last year
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆141Updated 3 years ago
- ☆36Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆172Updated last year
- State Space Models☆68Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year