umitkacar / awesome-tinymlLinks
TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.
☆34Updated 3 months ago
Alternatives and similar repositories for awesome-tinyml
Users that are interested in awesome-tinyml are comparing it to the libraries listed below
Sorting:
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated this week
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆13Updated last year
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆99Updated this week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆67Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- How to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO☆28Updated 11 months ago
- Converts CLIP models to ONNX☆11Updated 3 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Updated last year
- ONNX implementation of Whisper. PyTorch free.☆103Updated last year
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated 3 months ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆33Updated 4 years ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated 2 years ago
- The Facial Landmark Detection☆15Updated 6 months ago
- Deep Compression for PyTorch Model Deployment on Microcontrollers☆19Updated 4 years ago
- Get an OpenCV video capture from an YouTube video URL☆27Updated last year
- ☆23Updated 3 years ago
- Example of onnx quantization☆11Updated 3 years ago
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Updated 10 months ago
- ☆13Updated 3 weeks ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆52Updated last year
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆26Updated 2 weeks ago
- Model compression for ONNX☆98Updated last year
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code.☆10Updated 4 years ago
- Describing How to Enable OpenVINO Execution Provider for ONNX Runtime☆20Updated 5 years ago
- Provide Docker build sequences of Open3D for various environments.☆14Updated 4 years ago
- Model zoo for Gen AI models for Hailo products☆41Updated 2 weeks ago
- Machine vision apps☆39Updated last week
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Updated 2 years ago
- A straightforward explanation of how DeepSeek R1 works☆17Updated last year