PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from scratch.
☆37Nov 6, 2021Updated 4 years ago
Alternatives and similar repositories for MLP-Mixer-CIFAR
Users that are interested in MLP-Mixer-CIFAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)☆36May 11, 2021Updated 4 years ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆203Feb 5, 2024Updated 2 years ago
- some mixture of experts architecture implementations☆27Mar 22, 2024Updated 2 years ago
- ☆18Oct 15, 2021Updated 4 years ago
- ☆21Apr 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- EP☆19Mar 9, 2021Updated 5 years ago
- The code for tuning Spiking Neural Network based on Biologically-plausible Reward Propagation☆12Nov 23, 2020Updated 5 years ago
- Reproducing Spiking Transformers using Briancog☆14Apr 10, 2025Updated last year
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- 3D detection, tracking, and localization of spatially-compact static objects (such as signs and traffic lights) from a single camera of a…☆11Jan 21, 2021Updated 5 years ago
- ☆11Jul 24, 2018Updated 7 years ago
- The code associated with Comparing SNNs and RNNs on neuromorphic vision datasets: Similarities and differences.☆53Aug 20, 2021Updated 4 years ago
- ☆35Mar 13, 2021Updated 5 years ago
- A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently☆51Dec 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Text Classification model deployment using FastAPI, Streamlit and Docker Compose☆14Feb 12, 2021Updated 5 years ago
- ☆10Aug 3, 2023Updated 2 years ago
- ☆19Mar 18, 2021Updated 5 years ago
- PyTorch implementation of HashedNets☆38Apr 21, 2023Updated 2 years ago
- This is the proof-of-concept CPU implementation of ASPEN used for the NeurIPS'23 paper ASPEN: Breaking Operator Barriers for Efficient Pa…☆13Apr 4, 2024Updated 2 years ago
- ☆10Jun 28, 2019Updated 6 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Deep learning and standard machine learning methods are developed and compared in classfying audio samples from microphones deployed abo…☆11Jan 17, 2020Updated 6 years ago
- ☆10Aug 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Jupyter notebooks from our weekly (or so) hackathons☆11Dec 3, 2024Updated last year
- ☆14Aug 25, 2021Updated 4 years ago
- this is the code implementation of "SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Rec…☆17Oct 9, 2024Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆26Jun 21, 2022Updated 3 years ago
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Mar 11, 2026Updated last month
- Code for 'Adaptive Deep PnP Algorithm for Video Snapshot Compressive Imaging'☆18Jan 15, 2024Updated 2 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆33Sep 12, 2025Updated 7 months ago
- ☆16Apr 26, 2023Updated 2 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Oct 12, 2020Updated 5 years ago
- This is the repository for the ICLR2023 accepted paper -- Medical Image Understanding With Pretrained VLM☆31Jun 9, 2023Updated 2 years ago
- Decentralized Deep Reinforcement Learning based Real-World Applicable Traffic Signal Optimization☆11Jul 4, 2021Updated 4 years ago
- Text paraphrasing tool☆12Aug 29, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- A Simple and Efficient Reconstruction Backbone for Snapshot Compressive Imaging☆25Apr 18, 2023Updated 3 years ago
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 3 years ago