fastmachinelearning / qonnxLinks
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆149Updated this week
Alternatives and similar repositories for qonnx
Users that are interested in qonnx are comparing it to the libraries listed below
Sorting:
- Low Precision(quantized) Yolov5☆38Updated 2 months ago
- CSV spreadsheets and other material for AI accelerator survey papers☆169Updated last year
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- Dataflow QNN inference accelerator examples on FPGAs☆217Updated 2 months ago
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆241Updated this week
- ☆149Updated 2 years ago
- ☆99Updated this week
- Torch2Chip (MLSys, 2024)☆51Updated 2 months ago
- Vitis HLS Library for FINN☆197Updated last week
- Accelergy is an energy estimation infrastructure for accelerator energy estimations☆138Updated last week
- A tool to deploy Deep Neural Networks on PULP-based SoC's☆80Updated 3 months ago
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆143Updated this week
- The Riallto Open Source Project from AMD☆79Updated last month
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆90Updated last year
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆118Updated 3 months ago
- Machine-Learning Accelerator System Exploration Tools☆166Updated last week
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆110Updated 6 months ago
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆58Updated 3 years ago
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆26Updated 2 years ago
- PyTorch model to RTL flow for low latency inference☆126Updated last year
- Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts☆120Updated last year
- ☆90Updated last year
- ☆22Updated last year
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆338Updated last year
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆390Updated 2 months ago
- HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators☆150Updated 2 months ago
- ☆30Updated 2 years ago
- DPU on PYNQ☆221Updated last year
- muRISCV-NN is a collection of efficient deep learning kernels for embedded platforms and microcontrollers.☆79Updated this week
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆43Updated last year