Xilinx / brevitasLinks
Brevitas: neural network quantization in PyTorch
☆1,409Updated this week
Alternatives and similar repositories for brevitas
Users that are interested in brevitas are comparing it to the libraries listed below
Sorting:
- Dataflow compiler for QNN inference on FPGAs☆890Updated last week
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆451Updated 2 years ago
- An Open-Source Library for Training Binarized Neural Networks☆718Updated last year
- PyTorch implementation for the APoT quantization (ICLR 2020)☆277Updated 10 months ago
- QKeras: a quantization deep learning library for Tensorflow Keras☆576Updated 4 months ago
- Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.☆1,660Updated 6 months ago
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆397Updated 4 years ago
- ☆471Updated last year
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆161Updated this week
- Low Precision Arithmetic Simulation in PyTorch☆286Updated last year
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆303Updated last year
- [ICML'21 Oral] I-BERT: Integer-only BERT Quantization☆259Updated 2 years ago
- Quantization of Convolutional Neural networks.☆243Updated last year
- ☆646Updated 4 years ago
- Model Quantization Benchmark☆843Updated 6 months ago
- Machine learning on FPGAs using HLS☆1,670Updated last week
- A simple network quantization demo using pytorch from scratch.☆538Updated 2 years ago
- PyTorch Implementation of XNOR-Net☆495Updated 2 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Updated last year
- Dataflow QNN inference accelerator examples on FPGAs☆236Updated 2 months ago
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆431Updated 2 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆263Updated 2 years ago
- Summary, Code for Deep Neural Network Quantization☆554Updated 4 months ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,934Updated last year
- Binarized Neural Network (BNN) for pytorch☆525Updated last year
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆443Updated last year
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆423Updated last month
- MLPerf® Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers☆430Updated 2 months ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆609Updated last year
- A parser, editor and profiler tool for ONNX models.☆460Updated 2 months ago