google-ai-edge / ai-edge-quantizer
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆23Updated this week
Alternatives and similar repositories for ai-edge-quantizer:
Users that are interested in ai-edge-quantizer are comparing it to the libraries listed below
- ☆19Updated this week
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆16Updated 9 months ago
- Model compression for ONNX☆84Updated 2 months ago
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19Updated 9 months ago
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆107Updated 2 weeks ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- 3rd party dependencies for DALI project☆10Updated this week
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆27Updated last year
- Explore training for quantized models☆15Updated last month
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆52Updated 2 years ago
- Build TensorFlow Lite runtime with GitHub Actions☆25Updated last year
- Experiments with BitNet inference on CPU☆53Updated 10 months ago
- ☆12Updated this week
- Exports the ONNX file to a JSON file and JSON dict.☆32Updated 2 years ago
- cross-platform high speed inference SDK☆34Updated 3 weeks ago
- ☆21Updated this week
- A tracing JIT for PyTorch☆17Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆32Updated 4 months ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆136Updated this week
- Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.☆17Updated 9 months ago
- Test data for DALI project☆41Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆18Updated last week
- ☆31Updated 7 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 2 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 11 months ago