On-Device Training Under 256KB Memory [NeurIPS'22]
☆516Mar 29, 2024Updated last year
Alternatives and similar repositories for tiny-training
Users that are interested in tiny-training are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆929Nov 27, 2024Updated last year
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆667Mar 29, 2024Updated last year
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- ML model training for edge devices☆168Sep 29, 2023Updated 2 years ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,943Dec 14, 2023Updated 2 years ago
- [ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers☆15Jan 12, 2023Updated 3 years ago
- ☆13Jul 14, 2025Updated 8 months ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆41Mar 24, 2023Updated 2 years ago
- TinyChatEngine: On-Device LLM Inference Library☆945Jul 4, 2024Updated last year
- ☆1,099Nov 29, 2023Updated 2 years ago
- Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"☆20May 3, 2023Updated 2 years ago
- ☆30Feb 7, 2020Updated 6 years ago
- ☆14Aug 6, 2023Updated 2 years ago
- TinyMaix is a tiny inference library for microcontrollers (TinyML).☆1,040Feb 5, 2025Updated last year
- ☆77Nov 5, 2024Updated last year
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆75Oct 31, 2023Updated 2 years ago
- ☆176Aug 9, 2023Updated 2 years ago
- This is a list of interesting papers and projects about TinyML.☆988Dec 8, 2025Updated 3 months ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- ☆157Jun 22, 2023Updated 2 years ago
- The official implementation of TinyTrain [ICML '24]☆24Jul 19, 2024Updated last year
- Deep Compression for PyTorch Model Deployment on Microcontrollers☆19Mar 26, 2021Updated 4 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆362Jul 30, 2024Updated last year
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,463Jul 17, 2025Updated 8 months ago
- [FPGA-2022] N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores☆11Dec 16, 2021Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,803Updated this week
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Offsite-Tuning: Transfer Learning without Full Model☆388Nov 27, 2023Updated 2 years ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆49Oct 5, 2022Updated 3 years ago
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,621Jul 12, 2024Updated last year
- ☆35Mar 1, 2019Updated 7 years ago
- FPGA acceleration of arbitrary precision floating point computations.☆40May 17, 2022Updated 3 years ago
- [NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…☆52May 13, 2024Updated last year
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- Python library to work with the Visual Wake Words Dataset.☆39Oct 1, 2020Updated 5 years ago