omihub777 / MLP-Mixer-CIFAR
PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from scratch.
☆28Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for MLP-Mixer-CIFAR
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆55Updated 3 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- ☆186Updated 3 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆46Updated 3 years ago
- ☆11Updated 3 years ago
- ☆59Updated 3 years ago
- ☆57Updated last year
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆101Updated 4 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆136Updated 6 months ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆170Updated 9 months ago
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆87Updated last year
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆43Updated 4 years ago
- ☆67Updated 5 years ago
- Lookahead: A Far-sighted Alternative of Magnitude-based Pruning (ICLR 2020)☆33Updated 4 years ago
- Python implementation of the methods in Meulemans et al. 2020 - A Theoretical Framework For Target Propagation☆28Updated 3 weeks ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆84Updated 2 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆10Updated last year
- ☆14Updated 3 years ago
- Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks☆32Updated 2 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆207Updated last month
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Mode Connectivity and Fast Geometric Ensembles in PyTorch☆265Updated 2 years ago
- ☆14Updated last year
- Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint☆28Updated 2 years ago
- ☆33Updated 9 months ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆128Updated 5 years ago
- ☆194Updated last year
- Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"☆44Updated 4 years ago
- ☆82Updated 4 years ago
- ImageNet Testbed, associated with the paper "Measuring Robustness to Natural Distribution Shifts in Image Classification."☆116Updated last year