google-ai-edge / ai-edge-quantizer
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆27Updated this week
Alternatives and similar repositories for ai-edge-quantizer:
Users that are interested in ai-edge-quantizer are comparing it to the libraries listed below
- ☆19Updated 2 weeks ago
- Model compression for ONNX☆87Updated 4 months ago
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆27Updated 2 years ago
- Visualize ONNX models with model-explorer☆30Updated 2 weeks ago
- Explore training for quantized models☆17Updated 2 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆16Updated 10 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆39Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- Notes and artifacts from the ONNX steering committee☆25Updated 2 weeks ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆109Updated 3 weeks ago
- AI applications and tools☆26Updated last week
- ONNX implementation of Whisper. PyTorch free.☆92Updated 4 months ago
- Experiments with BitNet inference on CPU☆53Updated 11 months ago
- ☆35Updated this week
- ☆69Updated 2 years ago
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron☆20Updated last year
- ☆44Updated 9 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 5 months ago
- TORCH_LOGS parser for PT2☆35Updated this week
- Repository for ONNX working group artifacts☆24Updated 2 months ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆107Updated 3 months ago
- Extension package of Apache TVM (Machine Learning Compiler) for Renesas DRP-AI accelerators powered by Edgecortix MERA(TM) Based Apache T…☆48Updated last week
- ☆62Updated 3 weeks ago
- cross-platform high speed inference SDK☆34Updated last week
- ☆49Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆43Updated 2 weeks ago