jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated last year
Alternatives and similar repositories for WinogradAwareNets:
Users that are interested in WinogradAwareNets are comparing it to the libraries listed below
- Simulator for BitFusion☆97Updated 4 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆40Updated 4 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- Approximate layers - TensorFlow extension☆27Updated 10 months ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆17Updated 5 years ago
- ☆32Updated 4 years ago
- ☆70Updated 4 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- ☆19Updated 4 years ago
- ☆18Updated 3 years ago
- ☆32Updated 3 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Updated 2 years ago
- Training with Block Minifloat number representation☆14Updated 3 years ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆23Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- Neural Network Quantization With Fractional Bit-widths☆12Updated 4 years ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- ☆14Updated 5 years ago
- ☆27Updated 4 years ago
- Torch-7 implementation of BinaryDuo (ICLR 2020).☆9Updated 4 years ago
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆19Updated last year
- ☆19Updated last month
- ☆39Updated 8 months ago
- A general framework for optimizing DNN dataflow on systolic array☆34Updated 4 years ago
- Tool for optimize CNN blocking☆94Updated 4 years ago
- ☆71Updated 2 years ago
- A framework for fast exploration of the depth-first scheduling space for DNN accelerators☆37Updated 2 years ago
- ☆26Updated 3 months ago
- Designs for finalist teams of the DAC System Design Contest☆36Updated 4 years ago
- A reference implementation of the Mind Mappings Framework.☆29Updated 3 years ago