fawazsammani / awesome-mlp-mixerLinks
Transformers w/o Attention, based fully on MLPs
☆97Updated last year
Alternatives and similar repositories for awesome-mlp-mixer
Users that are interested in awesome-mlp-mixer are comparing it to the libraries listed below
Sorting:
- Recent Advances in MLP-based Models (MLP is all you need!)☆117Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆74Updated 3 years ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆79Updated 2 years ago
- A simple minimal implementation of Reversible Vision Transformers☆126Updated last year
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆81Updated 3 years ago
- Recent Advances on Efficient Vision Transformers☆55Updated 3 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 8 months ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆82Updated 2 years ago
- Official code for ICCV 2023 paper "Convolutional Networks with Oriented 1D Kernels"☆47Updated last year
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆55Updated last year
- This repo is for our paper: Normalization Techniques in Training DNNs: Methodology, Analysis and Application☆85Updated 4 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 3 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆217Updated 4 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆26Updated last year
- An implementation of the efficient attention module.☆327Updated 5 years ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆29Updated last year
- ☆40Updated 2 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆142Updated 4 years ago
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆47Updated 3 years ago
- ☆56Updated 2 years ago
- ☆25Updated 4 years ago
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆53Updated 3 years ago
- ☆69Updated last year
- Differentiable Top-k Classification Learning☆91Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆79Updated 3 years ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆131Updated last year
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆165Updated 3 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- A simple cross attention that updates both the source and target in one step☆194Updated 5 months ago