umitkacar / awesome-tinymlLinks
TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.
☆26Updated 2 months ago
Alternatives and similar repositories for awesome-tinyml
Users that are interested in awesome-tinyml are comparing it to the libraries listed below
Sorting:
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆88Updated this week
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated 3 months ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆32Updated 4 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- The Facial Landmark Detection☆15Updated 5 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆52Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Updated last year
- Converts CLIP models to ONNX☆11Updated 3 years ago
- Simple CogVLM client script☆14Updated 2 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11Updated last year
- Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.☆18Updated 3 months ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆28Updated last year
- Build TensorFlow Lite runtime with GitHub Actions☆27Updated 5 months ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- Integrate an LLM copilot within your Keras model development workflow☆28Updated 2 years ago
- mnn asr demo.☆23Updated 9 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆97Updated last year
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆19Updated 2 weeks ago
- Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"☆100Updated 4 months ago
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆25Updated 8 months ago
- How to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO☆28Updated 10 months ago
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆36Updated 3 years ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆52Updated 3 years ago
- A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier …☆15Updated 3 months ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated last year
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆17Updated 2 years ago
- This tool displays tflite signatures and rewrites the input/output OP name to the name of the signature. There is no need to install Tens…☆10Updated 2 years ago