QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆178Feb 19, 2026Updated last week
Alternatives and similar repositories for qonnx
Users that are interested in qonnx are comparing it to the libraries listed below
Sorting:
- Brevitas: neural network quantization in PyTorch☆1,488Updated this week
- Dataflow compiler for QNN inference on FPGAs☆945Updated this week
- Dataflow QNN inference accelerator examples on FPGAs☆244Aug 26, 2025Updated 6 months ago
- High Granularity Quantizarion for Ultra-Fast Machine Learning Applications on FPGAs☆39Jul 23, 2025Updated 7 months ago
- Resource Utilization and Latency Estimation for ML on FPGA.☆18Feb 4, 2026Updated 3 weeks ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Aug 17, 2022Updated 3 years ago
- Live demo of hls4ml on embedded platforms such as the Pynq-Z2☆12Aug 23, 2024Updated last year
- Vitis HLS Library for FINN☆215Updated this week
- Machine learning on FPGAs using HLS☆1,812Updated this week
- Fast inference of Boosted Decision Trees in FPGAs☆57Jan 28, 2026Updated last month
- ☆25Sep 19, 2025Updated 5 months ago
- ☆79Jul 21, 2022Updated 3 years ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- Tutorial notebooks for hls4ml☆407Feb 23, 2026Updated last week
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453May 15, 2023Updated 2 years ago
- New operators for the ReferenceEvaluator, new kernels for onnxruntime, CPU, CUDA☆35Feb 13, 2026Updated 2 weeks ago
- Low Precision(quantized) Yolov5☆47Mar 24, 2025Updated 11 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆429Updated this week
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆16Dec 29, 2024Updated last year
- ☆169Mar 9, 2023Updated 2 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- TQT's pytorch implementation.☆21Dec 17, 2021Updated 4 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆34Oct 4, 2023Updated 2 years ago
- Example for applying Gaussian and Laplace clipping on activations of CNN.☆34Jan 20, 2019Updated 7 years ago
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆52Sep 8, 2022Updated 3 years ago
- Train and deploy LUT-based neural networks on FPGAs☆107Jun 12, 2024Updated last year
- PyTorch implementation for the APoT quantization (ICLR 2020)☆283Dec 11, 2024Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆431Updated this week
- ☆37Jun 1, 2022Updated 3 years ago
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆27Jan 27, 2023Updated 3 years ago
- Model compression for ONNX☆100Feb 19, 2026Updated last week
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 3 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆290Aug 1, 2021Updated 4 years ago
- A tool for parsing, editing, optimizing, and profiling ONNX models.☆480Feb 10, 2026Updated 2 weeks ago
- ONNX Optimizer☆797Feb 4, 2026Updated 3 weeks ago
- Build TensorFlow Lite runtime with GitHub Actions☆27Jul 25, 2025Updated 7 months ago
- ☆31Nov 7, 2024Updated last year
- PyTorch model to RTL flow for low latency inference☆131Mar 15, 2024Updated last year