microsoft / nn-MeterLinks

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

☆357

Alternatives and similar repositories for nn-Meter

Users that are interested in nn-Meter are comparing it to the libraries listed below

Sorting:

Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆443Updated 2 years ago
mit-han-lab / haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆392Updated 4 years ago
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆262Updated last year
amirgholami / ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
☆280Updated last year
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆200Updated 3 years ago
submission2019 / cnn-quantization
Quantization of Convolutional Neural networks.
☆244Updated last year
yhhhli / APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
☆277Updated 7 months ago
aojunzz / NM-sparsity
☆236Updated 2 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆282Updated 4 years ago
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
mit-han-lab / amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆443Updated last year
ModelTC / NNLQP
☆36Updated 3 years ago
ModelTC / MQBench
Model Quantization Benchmark
☆827Updated 3 months ago
A-suozhang / awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
☆161Updated 4 years ago
ucbrise / actnn
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
☆200Updated 2 years ago
csyhhu / Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
☆552Updated last month
GATECH-EIC / HW-NAS-Bench
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
☆111Updated 2 years ago
csu-eis / CoDL
☆77Updated 2 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆99Updated 4 years ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
cedrickchee / awesome-ml-model-compression
Awesome machine learning model compression research papers, quantization, tools, and learning material.
☆527Updated 10 months ago
deepglint / EasyQuant
EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…
☆402Updated 2 years ago
Qualcomm-AI-research / FP8-quantization
☆154Updated 2 years ago
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆347Updated 2 years ago
Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆97Updated last year
mit-han-lab / apq
[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆159Updated 5 years ago
microsoft / nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆991Updated 10 months ago
learning1234embed / NeuralWeightVirtualization
[MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization
☆15Updated 5 years ago
zhutmost / lsq-net
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆299Updated last year
walkerning / aw_nas
aw_nas: A Modularized and Extensible NAS Framework
☆250Updated last year