[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
☆924Nov 27, 2024Updated last year
Alternatives and similar repositories for tinyengine
Users that are interested in tinyengine are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆654Mar 29, 2024Updated last year
- On-Device Training Under 256KB Memory [NeurIPS'22]☆516Mar 29, 2024Updated last year
- ☆1,081Nov 29, 2023Updated 2 years ago
- TinyChatEngine: On-Device LLM Inference Library☆942Jul 4, 2024Updated last year
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆486Oct 23, 2024Updated last year
- MLPerf® Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers☆443Jan 8, 2026Updated last month
- TinyMaix is a tiny inference library for microcontrollers (TinyML).☆1,035Feb 5, 2025Updated last year
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,940Dec 14, 2023Updated 2 years ago
- Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digit…☆2,766Feb 12, 2026Updated 2 weeks ago
- ☆248Mar 31, 2023Updated 2 years ago
- ☆177Aug 9, 2023Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆48Mar 19, 2020Updated 5 years ago
- This is a list of interesting papers and projects about TinyML.☆974Dec 8, 2025Updated 2 months ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,563Updated this week
- CMSIS-NN Library☆370Jan 22, 2026Updated last month
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆82Jan 26, 2021Updated 5 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,612Jul 12, 2024Updated last year
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆866Dec 24, 2025Updated 2 months ago
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆92Oct 6, 2025Updated 4 months ago
- ML model training for edge devices☆168Sep 29, 2023Updated 2 years ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆75Oct 31, 2023Updated 2 years ago
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,441Jul 17, 2025Updated 7 months ago
- Open Machine Learning Compiler Framework☆13,142Updated this week
- A higher-level Neural Network library for microcontrollers.☆1,134Apr 8, 2024Updated last year
- ☆24Mar 19, 2022Updated 3 years ago
- Arm NN ML Software.☆1,297Jan 23, 2026Updated last month
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Sep 19, 2024Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,706Jan 12, 2026Updated last month
- Model Quantization Benchmark☆858Apr 20, 2025Updated 10 months ago
- A model compression and acceleration toolbox based on pytorch.☆333Jan 12, 2024Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453May 15, 2023Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Jul 30, 2024Updated last year
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,585Feb 20, 2026Updated last week
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆33Updated this week
- ☆30Feb 7, 2020Updated 6 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,258Sep 7, 2025Updated 5 months ago
- Simplify your onnx model☆4,297Updated this week
- TinyML AI inference library☆1,903May 10, 2025Updated 9 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago