mlcommons / mobile_app_openLinks

Mobile App Open

☆58

Alternatives and similar repositories for mobile_app_open

Users that are interested in mobile_app_open are comparing it to the libraries listed below

Sorting:

quic / efficient-transformers
This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…
☆74Updated this week
onnx / neural-compressor
Model compression for ONNX
☆97Updated 8 months ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆94Updated 9 months ago
hisrg / SNPE
Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…
☆34Updated 3 years ago
zhenhuaw-me / onnxcli
ONNX Command-Line Toolbox
☆35Updated 9 months ago
jinmingyi1998 / opencl_kernels
An easy way to run, test, benchmark and tune OpenCL kernel files
☆23Updated last year
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆138Updated 2 years ago
MegEngine / mgeconvert
MegEngine到其他框架的转换器
☆70Updated 2 years ago
Libraries-Openly-Fused / cvGPUSpeedup
A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!
☆51Updated last week
pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated 3 weeks ago
jundaf2 / INT8-Flash-Attention-FMHA-Quantization
☆158Updated last year
pytorch-labs / tokenizers
C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆34Updated last week
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆276Updated 3 weeks ago
Qualcomm-AI-research / FP8-quantization
☆154Updated 2 years ago
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆56Updated 3 years ago
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
masahi / torchscript-to-tvm
☆69Updated 2 years ago
daquexian / web-model-converter
☆41Updated 2 years ago
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆157Updated last year
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆62Updated 2 weeks ago
zhenhuaw-me / tflite
Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/
☆102Updated 6 months ago
Oneflow-Inc / OneFlow-Pruning
[CVPR-2023] Towards Any Structural Pruning
☆17Updated 2 years ago
MegEngine / cutlass
CUDA Templates for Linear Algebra Subroutines
☆100Updated last year
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆151Updated last week
scailable / sclblonnx
Scailable ONNX python tools
☆97Updated 9 months ago
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆138Updated last year
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆289Updated last year
quic / ai-engine-direct-helper
QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …
☆59Updated this week
ModelTC / pyvlova
Yet another Polyhedra Compiler for DeepLearning
☆19Updated 2 years ago
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆369Updated this week