mlcommons / mobile_app_openLinks
Mobile App Open
☆64Updated this week
Alternatives and similar repositories for mobile_app_open
Users that are interested in mobile_app_open are comparing it to the libraries listed below
Sorting:
- Model compression for ONNX☆99Updated last year
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆84Updated this week
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆43Updated this week
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆81Updated 3 weeks ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated 3 weeks ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆182Updated 3 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- Common utilities for ONNX converters☆288Updated 3 months ago
- ONNX Command-Line Toolbox☆35Updated last year
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆35Updated 3 years ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated 2 years ago
- Customized matrix multiplication kernels☆57Updated 3 years ago
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- ☆159Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆300Updated last year
- ☆125Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆162Updated last month
- ☆68Updated 2 years ago
- Export utility for unconstrained channel pruned models☆72Updated 2 years ago
- A Toolkit to Help Optimize Onnx Model☆267Updated last week
- ☆34Updated 5 months ago
- A block oriented training approach for inference time optimization.☆33Updated last year
- edge/mobile transformer based Vision DNN inference benchmark☆16Updated 3 months ago
- ☆128Updated last week
- CUDA Templates for Linear Algebra Subroutines☆101Updated last year
- ☆166Updated 2 years ago
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆92Updated this week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆412Updated this week