fawazsammani / awesome-mlp-mixerLinks
Transformers w/o Attention, based fully on MLPs
☆95Updated last year
Alternatives and similar repositories for awesome-mlp-mixer
Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Updated 3 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆117Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆81Updated last year
- A simple minimal implementation of Reversible Vision Transformers☆126Updated last year
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆79Updated 2 years ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆81Updated 2 years ago
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆61Updated last year
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆26Updated last year
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆77Updated last year
- ☆54Updated 2 years ago
- Recent Advances on Efficient Vision Transformers☆55Updated 2 years ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆66Updated 2 years ago
- ☆69Updated last year
- Differentiable Top-k Classification Learning☆89Updated 2 years ago
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆85Updated 4 years ago
- ☆25Updated 4 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆55Updated last year
- Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).☆31Updated 3 years ago
- ☆47Updated 2 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 3 years ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- A simple cross attention that updates both the source and target in one step☆189Updated 4 months ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆56Updated 2 years ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆165Updated 3 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 4 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 6 months ago
- ☆187Updated last year