NVIDIA / sampleQATLinks

Inference of quantization aware trained networks using TensorRT

☆83

Alternatives and similar repositories for sampleQAT

Users that are interested in sampleQAT are comparing it to the libraries listed below

Sorting:

gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆95Updated last year
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆145Updated last year
ModelTC / Dipoorlet
Offline Quantization Tools for Deploy.
☆141Updated last year
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆263Updated 2 years ago
DeadAt0m / LSQFakeQuantize-PyTorch
FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch
☆36Updated 3 years ago
AI-performance / embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
☆204Updated 4 years ago
masahi / torchscript-to-tvm
☆68Updated 2 years ago
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆468Updated last month
leimao / PyTorch-Static-Quantization
PyTorch Static Quantization Example
☆38Updated 4 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆98Updated 4 years ago
FrozenGene / tvm-tutorial
TVM tutorial
☆66Updated 6 years ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆141Updated 3 years ago
vinx13 / tvm-cuda-int8-benchmark
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
ModelTC / NNLQP
☆36Updated 3 years ago
A-suozhang / awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
☆160Updated 4 years ago
MegEngine / examples
A set of examples around MegEngine
☆31Updated last year
tlc-pack / TLCBench
Benchmark scripts for TVM
☆74Updated 3 years ago
ModelTC / mqbench-paper
☆44Updated 4 years ago
deepglint / EasyQuant
EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…
☆404Updated 3 years ago
aojunzz / NM-sparsity
☆243Updated 3 years ago
tlc-pack / tophub
tophub autotvm log collections
☆69Updated 2 years ago
ChenShisen / ncnnqat
quantize aware training package for NCNN on pytorch
☆69Updated 4 years ago
TrojanXu / onnxparser-trt-plugin-sample
A sample for onnxparser working with trt user defined plugins for TRT7.0
☆170Updated 5 years ago
Xilinx / graffitist
Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow
☆168Updated 5 years ago
Qualcomm-AI-research / FP8-quantization
☆166Updated 2 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆284Updated 4 years ago
ModelTC / NART
NART = NART is not A RunTime, a deep learning inference framework.
☆37Updated 2 years ago
EunhyeokPark / PROFIT
☆49Updated 3 years ago
Qualcomm-AI-research / transformer-quantization
☆207Updated 4 years ago
BBuf / how-to-optimize-gemm
☆98Updated 4 years ago