fawazsammani / awesome-mlp-mixerLinks
Transformers w/o Attention, based fully on MLPs
☆93Updated last year
Alternatives and similar repositories for awesome-mlp-mixer
Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below
Sorting:
- Recent Advances in MLP-based Models (MLP is all you need!)☆115Updated 2 years ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆80Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆145Updated 3 years ago
- Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)☆36Updated 4 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆79Updated last year
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆48Updated 8 months ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆77Updated last year
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆84Updated 3 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 4 years ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆156Updated 2 years ago
- PyTorch implementations of KMeans, Soft-KMeans and Constrained-KMeans which can be run on GPU and work on (mini-)batches of data.☆65Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated 2 years ago
- ☆47Updated 2 years ago
- A simple minimal implementation of Reversible Vision Transformers☆125Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 3 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆100Updated 3 years ago
- This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022☆173Updated last year
- Whitening for Self-Supervised Representation Learning | Official repository☆129Updated 2 years ago
- The official github repo for "Test-Time Training with Masked Autoencoders"☆83Updated last year
- Differentiable Top-k Classification Learning☆82Updated 2 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆42Updated 4 years ago
- (ICLR 2023) Official PyTorch implementation of "What Do Self-Supervised Vision Transformers Learn?"☆110Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)☆76Updated 4 years ago
- ☆25Updated 3 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 2 years ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆83Updated last year
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆69Updated last year