umitkacar / awesome-tinymlLinks
TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.
☆20Updated last week
Alternatives and similar repositories for awesome-tinyml
Users that are interested in awesome-tinyml are comparing it to the libraries listed below
Sorting:
- Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"☆98Updated 2 months ago
- A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper…☆30Updated 3 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- ☆22Updated 3 years ago
- KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆23Updated 6 months ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆78Updated this week
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Updated 2 years ago
- Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)☆18Updated 2 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆12Updated last year
- Integrate an LLM copilot within your Keras model development workflow☆28Updated 2 years ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated last week
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆28Updated 4 years ago
- ☆10Updated 8 months ago
- Knock your images before you get stressed.☆10Updated 3 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆48Updated last year
- [IEEE TMC] Official Repository for the paper on Efficient Neural Feature Compression with Shallow Variational Bottleneck Injection for Mo…☆22Updated 5 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- The Facial Landmark Detection☆15Updated 4 months ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11Updated last year
- ☆46Updated last year
- [Applied Intelligence 2022] Python code for ACP☆12Updated 2 years ago
- Measuring RAG solutions throughput and latency☆18Updated last year
- A project using YoloV8 to detect License Plates☆12Updated 2 years ago
- Converts CLIP models to ONNX☆11Updated 2 years ago
- Implementation of transformers based architecture in PyTorch.☆54Updated 4 years ago
- Code for blog posts from OpenCV.AI☆15Updated 2 years ago
- Model compression for ONNX☆98Updated last year
- Personalized machine learning on the smartphone☆58Updated 2 years ago
- paper-read-notes☆14Updated last year