erendn / pytorch-compression-for-mcu
Deep Compression for PyTorch Model Deployment on Microcontrollers
☆19Updated 4 years ago
Alternatives and similar repositories for pytorch-compression-for-mcu:
Users that are interested in pytorch-compression-for-mcu are comparing it to the libraries listed below
- INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target☆18Updated 5 years ago
- ☆31Updated 5 years ago
- ☆29Updated 3 years ago
- Bring your AI to the Edge - Starting from building the ML model to the selection of the target platform to the optimization and implement…☆52Updated 2 years ago
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 3 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆41Updated 5 years ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆23Updated 3 years ago
- A Python package with command-line utilities and scripts to aid the development of machine learning models for Silicon Lab's embedded pl…☆56Updated last month
- Lightweight C implementation of CNNs for Embedded Systems☆60Updated 2 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 11 months ago
- Awesome Quantization Paper lists with Codes☆11Updated 4 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- Code for IEEE Internet Computing Journal paper 'OTA-TinyML: Over the Air Deployment of TinyML Models and Execution on IoT Devices'☆22Updated 2 years ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆23Updated 3 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆81Updated 4 years ago
- Quantization and Synthesis (Device Specific Code Generation) for ADI's MAX78000 and MAX78002 Edge AI Devices☆59Updated last week
- Code for WF-IoT paper 'TinyML Benchmark: Executing Fully Connected Neural Networks on Commodity Microcontrollers'☆35Updated 2 years ago
- Mobilenet v1 trained on Imagenet for STM32 using extended CMSIS-NN with INT-Q quantization support☆86Updated 5 years ago
- Model Training for ADI's MAX78000 and MAX78002 Edge AI Devices☆100Updated last week
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- ☆222Updated 2 years ago
- A library to train and deploy quantised Deep Neural Networks☆23Updated 4 months ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- Efficient neural network deployment for uC using pytorch model☆32Updated 4 years ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆80Updated 2 months ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated 2 years ago
- NEural Minimizer for pytOrch☆42Updated 9 months ago
- ☆19Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Tiny Inference v0.7 benchmark.☆17Updated last year