On-Device Training Under 256KB Memory [NeurIPS'22]
☆520Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for tiny-training
Users that are interested in tiny-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆943Nov 27, 2024Updated last year
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆693Mar 29, 2024Updated 2 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- ML model training for edge devices☆169Sep 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,948Dec 14, 2023Updated 2 years ago
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆16Jan 12, 2023Updated 3 years ago
- ☆14Jul 14, 2025Updated 10 months ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆41Mar 24, 2023Updated 3 years ago
- TinyChatEngine: On-Device LLM Inference Library☆952Jul 4, 2024Updated last year
- ☆1,161Nov 29, 2023Updated 2 years ago
- Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"☆20May 3, 2023Updated 3 years ago
- ☆30Feb 7, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Aug 6, 2023Updated 2 years ago
- TinyMaix is a tiny inference library for microcontrollers (TinyML).☆1,056Feb 5, 2025Updated last year
- ☆78Nov 5, 2024Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆75Oct 31, 2023Updated 2 years ago
- ☆179Aug 9, 2023Updated 2 years ago
- This is a list of interesting papers and projects about TinyML.☆1,005Dec 8, 2025Updated 5 months ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- ☆157Jun 22, 2023Updated 2 years ago
- Deep Compression for PyTorch Model Deployment on Microcontrollers☆20Mar 26, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,536Jul 17, 2025Updated 10 months ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆365Jul 30, 2024Updated last year
- The official implementation of TinyTrain [ICML '24]☆27Jul 19, 2024Updated last year
- [FPGA-2022] N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores☆11Dec 16, 2021Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,905Updated this week
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Offsite-Tuning: Transfer Learning without Full Model☆388Nov 27, 2023Updated 2 years ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆49Oct 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 4 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,647Jul 12, 2024Updated last year
- ☆35Mar 1, 2019Updated 7 years ago
- FPGA acceleration of arbitrary precision floating point computations.☆41May 17, 2022Updated 4 years ago
- [NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…☆52May 13, 2024Updated 2 years ago
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- Python library to work with the Visual Wake Words Dataset.☆40Oct 1, 2020Updated 5 years ago