taehokim20 / CPrune
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
☆16Updated last year
Alternatives and similar repositories for CPrune:
Users that are interested in CPrune are comparing it to the libraries listed below
- Post-training sparsity-aware quantization☆34Updated last year
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Updated 2 years ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Updated 5 years ago
- BitSplit Post-trining Quantization☆48Updated 3 years ago
- Neural Network Quantization With Fractional Bit-widths☆12Updated 4 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆38Updated 4 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆25Updated 2 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- Benchmark PyTorch Custom Operators☆13Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Updated 2 years ago
- This is the implementation for paper: AdaTune: Adaptive Tensor Program CompilationMade Efficient (NeurIPS 2020).☆13Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 3 years ago
- A 8-/16-/32-/64-bit floating point number family☆17Updated 3 years ago
- ☆34Updated 2 years ago
- ☆69Updated last year
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 9 months ago
- ☆17Updated 4 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆105Updated 2 months ago
- Benchmark scripts for TVM☆73Updated 2 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- ☆30Updated last year
- This is an implementation of YOLO using LSQ network quantization method.☆22Updated 2 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆50Updated 2 years ago
- ☆15Updated 3 years ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆22Updated 3 months ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆34Updated last year
- ☆36Updated 2 years ago