fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆148Updated this week
Alternatives and similar repositories for qonnx
Users that are interested in qonnx are comparing it to the libraries listed below
Sorting:
- Low Precision(quantized) Yolov5☆37Updated last month
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆86Updated 2 years ago
- Torch2Chip (MLSys, 2024)☆51Updated last month
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆110Updated 5 months ago
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆224Updated 3 weeks ago
- Machine-Learning Accelerator System Exploration Tools☆161Updated 2 weeks ago
- ☆146Updated 2 years ago
- The Riallto Open Source Project from AMD☆77Updated last month
- Dataflow QNN inference accelerator examples on FPGAs☆213Updated last month
- Accelergy is an energy estimation infrastructure for accelerator energy estimations☆136Updated 2 months ago
- PyTorch model to RTL flow for low latency inference☆126Updated last year
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆85Updated 11 months ago
- ☆95Updated this week
- Open Source Compiler Framework using ONNX as Frontend and IR☆30Updated 2 years ago
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆80Updated 2 months ago
- Vitis HLS Library for FINN☆193Updated this week
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆142Updated this week
- Approximate layers - TensorFlow extension☆27Updated 3 weeks ago
- CSV spreadsheets and other material for AI accelerator survey papers☆167Updated last year
- Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts☆118Updated last year
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆25Updated 2 years ago
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆58Updated 3 years ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆381Updated last month
- ☆89Updated last year
- The codes and artifacts associated with our MICRO'22 paper titled: "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware …☆131Updated last year
- Open, Modular, Deep Learning Accelerator☆286Updated last year
- A survey on Hardware Accelerated LLMs☆51Updated 4 months ago
- ☆30Updated 2 years ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆338Updated last year
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year