jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated last year
Alternatives and similar repositories for WinogradAwareNets:
Users that are interested in WinogradAwareNets are comparing it to the libraries listed below
- Simulator for BitFusion☆97Updated 4 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆40Updated 4 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- ProxQuant: Quantized Neural Networks via Proximal Operators☆29Updated 6 years ago
- ☆34Updated 4 years ago
- ☆33Updated 3 years ago
- ☆70Updated 5 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- ☆36Updated 6 years ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆80Updated 2 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆58Updated 4 years ago
- ☆39Updated 2 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- ☆18Updated 3 years ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆23Updated 2 years ago
- Approximate layers - TensorFlow extension☆27Updated this week
- Neural Network Quantization With Fractional Bit-widths☆12Updated 4 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆19Updated last year
- Tool for optimize CNN blocking☆94Updated 5 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆17Updated 5 years ago
- ☆14Updated 5 years ago
- Torch-7 implementation of BinaryDuo (ICLR 2020).☆9Updated 4 years ago
- ☆19Updated 4 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- ☆76Updated 2 years ago
- BitSplit Post-trining Quantization☆49Updated 3 years ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- DAC System Design Contest 2020☆29Updated 4 years ago
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Updated 6 years ago