PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)
☆31May 13, 2021Updated 5 years ago
Alternatives and similar repositories for mlp-mixer-pytorch
Users that are interested in mlp-mixer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Aug 28, 2019Updated 6 years ago
- Manifold-Mixup implementation for fastai V1☆19Oct 1, 2020Updated 5 years ago
- How to use tensorboard in fastai☆21Jul 10, 2019Updated 6 years ago
- Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision☆216May 5, 2021Updated 5 years ago
- PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]☆16Apr 30, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16May 14, 2025Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆30Feb 20, 2026Updated 4 months ago
- ☆11Feb 18, 2022Updated 4 years ago
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Mar 11, 2026Updated 3 months ago
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 6 years ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆111Feb 10, 2026Updated 4 months ago
- ☆13Mar 28, 2022Updated 4 years ago
- An recognition oriented deep learning framework for biometric sample quality assessment☆12Aug 24, 2023Updated 2 years ago
- ☆22Jan 23, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AWS Lambda + rio-tiler to serve tiles from any web hosted files☆11Jul 23, 2020Updated 5 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ☆12Apr 6, 2026Updated 2 months ago
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 4 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 3 years ago
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- Jax implementation of the AdaHessian optimizer☆19Mar 11, 2021Updated 5 years ago
- ☆15Apr 26, 2022Updated 4 years ago
- dancetrack 比赛第二名☆13Jan 29, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Starlight: A Kernel Optimizer for GPU Processing☆16Jan 10, 2024Updated 2 years ago
- ActiveHARNet: Towards On-Device Deep Bayesian Active Learning for Human Activity Recognition☆16Nov 7, 2020Updated 5 years ago
- The code for "No Routing Needed Between Capsules". This repository contains the code used for the experiments detailed in a forthcoming …☆52Jun 25, 2021Updated 5 years ago
- 16 bit serial multiplier in SystemVerilog☆13Oct 13, 2018Updated 7 years ago
- ☆14Mar 21, 2020Updated 6 years ago
- Code for applying ML algorithms trained on ground spectra to autoclassify ice surface type in UAV or Sentinel-2 derived multispectral ima…☆11Dec 5, 2019Updated 6 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pyt…☆16Jul 5, 2024Updated last year
- stitch together image tiles for large-format prints☆14Jan 26, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- [IROS 2016] Implements an adaptive gating sensor fusion approach for object detection based on a mixture of convolutional neural network…☆10Mar 16, 2020Updated 6 years ago
- Code release for "Learning from Missing Relations: Contrastive Learning with Commonsense Knowledge Graphs for Commonsense Inference"☆10Jun 25, 2022Updated 4 years ago
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆15Jun 15, 2023Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- A PyTorch implementation of MixNet: Mixed Depthwise Convolutional Kernels☆11Aug 5, 2019Updated 6 years ago
- ☆15Jun 11, 2025Updated last year