wangmaolin / niti
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆77Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for niti
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- Post-training sparsity-aware quantization☆33Updated last year
- ☆68Updated 2 years ago
- Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"☆73Updated 4 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- Simulator for BitFusion☆92Updated 4 years ago
- BNNs (XNOR, BNN and DoReFa) implementation for PyTorch 1.0+☆39Updated last year
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆36Updated 3 years ago
- Approximate layers - TensorFlow extension☆26Updated 7 months ago
- A collection of research papers on efficient training of DNNs☆68Updated 2 years ago
- ☆36Updated 5 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 2 years ago
- Low Precision Arithmetic Simulation in PyTorch☆265Updated 6 months ago
- Reproduction of WAGE in PyTorch.☆41Updated 6 years ago
- Binarize convolutional neural networks using pytorch☆134Updated 2 years ago
- DNN quantization with outlier channel splitting☆112Updated 4 years ago
- Open-source artifacts and codes of our MICRO'23 paper titled “Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi…☆32Updated last year
- pytorch fixed point training tool/framework☆34Updated 4 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 6 months ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆32Updated last year
- ☆69Updated 4 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆100Updated 11 months ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆25Updated last year
- Accelergy is an energy estimation infrastructure for accelerator energy estimations☆126Updated 2 months ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆13Updated 2 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆131Updated 4 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year
- The codes and artifacts associated with our MICRO'22 paper titled: "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware …☆113Updated last year
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆11Updated 3 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆46Updated 2 weeks ago