jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for WinogradAwareNets
- Simulator for BitFusion☆92Updated 4 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆36Updated 3 years ago
- Post-training sparsity-aware quantization☆33Updated last year
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Updated last year
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 6 months ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- ☆30Updated 4 years ago
- ☆69Updated 4 years ago
- Neural Network Quantization With Fractional Bit-widths☆12Updated 3 years ago
- ☆31Updated 3 years ago
- Approximate layers - TensorFlow extension☆26Updated 7 months ago
- ☆68Updated 2 years ago
- ☆36Updated 5 years ago
- DNN quantization with outlier channel splitting☆112Updated 4 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 2 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆46Updated 2 weeks ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆22Updated last month
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆12Updated 3 years ago
- Conditional channel- and precision-pruning on neural networks☆72Updated 4 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆32Updated last year
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆57Updated 4 years ago
- ☆38Updated last year
- mixed-precision quantization for LLMs☆14Updated last year
- ☆34Updated 4 months ago
- [ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vi…☆30Updated 8 months ago
- ☆19Updated 3 years ago
- ☆18Updated 2 years ago
- RTL implementation of Flex-DPE.☆91Updated 4 years ago
- ☆27Updated 4 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆17Updated 5 years ago