fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆125Updated last week
Related projects ⓘ
Alternatives and complementary repositories for qonnx
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆162Updated last month
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- Low Precision(quantized) Yolov5☆31Updated 9 months ago
- ☆121Updated last year
- The Riallto Open Source Project from AMD☆68Updated last week
- Dataflow QNN inference accelerator examples on FPGAs☆181Updated this week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆80Updated last year
- Torch2Chip (MLSys, 2024)☆50Updated 2 months ago
- Vitis HLS Library for FINN☆178Updated 2 weeks ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆29Updated 2 years ago
- PyTorch model to RTL flow for low latency inference☆121Updated 7 months ago
- Fast inference of Boosted Decision Trees in FPGAs☆48Updated 2 months ago
- Accelergy is an energy estimation infrastructure for accelerator energy estimations☆126Updated 2 months ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆113Updated this week
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆53Updated 5 months ago
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆24Updated last year
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆77Updated 7 months ago
- ☆79Updated this week
- Repository to host and maintain scale-sim-v2 code☆232Updated this week
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆62Updated last week
- The codes and artifacts associated with our MICRO'22 paper titled: "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware …☆110Updated last year
- Open, Modular, Deep Learning Accelerator☆254Updated 7 months ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆305Updated this week
- Research and Materials on Hardware implementation of Transformer Model☆205Updated this week
- ☆82Updated 4 months ago
- IREE plugin repository for the AMD AIE accelerator☆66Updated this week
- A scalable High-Level Synthesis framework on MLIR☆226Updated 5 months ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆336Updated this week
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year