Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for WinogradAwareNets
Users that are interested in WinogradAwareNets are comparing it to the libraries listed below
Sorting:
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- A Winograd Minimal Filter Implementation in CUDA☆28Aug 25, 2021Updated 4 years ago
- Winograd minimal convolution algorithm generator for convolutional neural networks.☆627Feb 9, 2026Updated last month
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Post-training sparsity-aware quantization☆34Feb 26, 2023Updated 3 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Jul 7, 2022Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 2 years ago
- Torch Frontend for IREE☆25Dec 21, 2023Updated 2 years ago
- ☆23Updated this week
- Domain-Specific Architecture Generator 2☆22Oct 2, 2022Updated 3 years ago
- UniSparse: An Intermediate Language for General Sparse Format Customization (OOPSLA'24)☆33Nov 12, 2024Updated last year
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- Neural network quantization for research and prototyping☆42Updated this week
- PyTorch implementation of BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models☆29Aug 22, 2022Updated 3 years ago
- Lernd is ∂ILP (dILP) framework implementation based on Deepmind's paper Learning Explanatory Rules from Noisy Data.☆27Mar 25, 2023Updated 2 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Feb 16, 2026Updated 3 weeks ago
- ☆28Dec 2, 2024Updated last year
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆25Oct 1, 2022Updated 3 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆170Dec 9, 2019Updated 6 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆115Apr 18, 2023Updated 2 years ago
- FrostNet: Towards Quantization-Aware Network Architecture Search☆105May 3, 2024Updated last year
- My implementation of an FPGA Deep Neural Network Hardware Accelerator, moved from my bitbucket☆28Jul 31, 2019Updated 6 years ago
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- implement of DoReFaNet with tensorflow based on cifar10 dataset☆28Nov 8, 2017Updated 8 years ago
- ☆27Oct 26, 2019Updated 6 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆31Mar 2, 2024Updated 2 years ago
- ☆72Mar 22, 2020Updated 5 years ago
- Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"☆27Oct 24, 2019Updated 6 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Aug 17, 2022Updated 3 years ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆286Dec 11, 2024Updated last year
- Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation☆27May 2, 2021Updated 4 years ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆30Jun 25, 2017Updated 8 years ago
- ☆21Nov 12, 2025Updated 3 months ago
- Python crossplatform library for Mac/linux and widows os.Complete system command, send alert, notifications, set brightness, recording au…☆11Apr 25, 2025Updated 10 months ago
- A minimal and interpretable Brian2 based DYNAP neuromorphic processor simulator for educational purposes.☆12Jun 23, 2022Updated 3 years ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- An OpenCL-Based FPGA Accelerator for Compressed YOLOv2☆39May 27, 2021Updated 4 years ago
- Explore the energy-efficient dataflow scheduling for neural networks.☆233Aug 24, 2020Updated 5 years ago