mlcommons / mobile_app_openLinks
Mobile App Open
☆63Updated this week
Alternatives and similar repositories for mobile_app_open
Users that are interested in mobile_app_open are comparing it to the libraries listed below
Sorting:
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆83Updated this week
- Model compression for ONNX☆98Updated last year
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆76Updated this week
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆35Updated 3 years ago
- A code generator from ONNX to PyTorch code☆141Updated 3 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- Customized matrix multiplication kernels☆57Updated 3 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆102Updated 9 months ago
- ☆338Updated last year
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆181Updated 2 months ago
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆40Updated this week
- Common utilities for ONNX converters☆284Updated 2 months ago
- MediaTek's TFLite delegate☆49Updated last year
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆84Updated last week
- ☆166Updated last week
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆53Updated last week
- The official, proof-of-concept C++ implementation of PocketNN.☆35Updated last month
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆95Updated this week
- Build TVM docker image for production compilation deployments☆12Updated 4 years ago
- Scailable ONNX python tools☆97Updated last year
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆164Updated this week
- ☆207Updated 4 years ago
- ☆165Updated 2 years ago
- PyTorch Quantization Aware Training Example☆144Updated last year
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆425Updated this week
- Export utility for unconstrained channel pruned models☆72Updated 2 years ago
- A Toolkit to Help Optimize Large Onnx Model☆162Updated 3 weeks ago