leimao / LibTorch-ResNet-CIFAR
ResNet Implementation, Training, and Inference Using LibTorch C++ API
☆34Updated 3 months ago
Related projects: ⓘ
- PyTorch Quantization Aware Training Example☆119Updated 4 months ago
- Swin Transformer C++ Implementation☆53Updated 3 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 4 years ago
- PyTorch Pruning Example☆46Updated last year
- Inference of quantization aware trained networks using TensorRT☆77Updated last year
- An Open Convolutional Neural Network Framework in C++ From Scratch☆57Updated 3 years ago
- quantize aware training package for NCNN on pytorch☆68Updated 3 years ago
- Manually implemented quantization-aware training☆21Updated last year
- CUDA Templates for Linear Algebra Subroutines☆90Updated 4 months ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆84Updated 2 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆18Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆32Updated 2 years ago
- ONNX Command-Line Toolbox☆35Updated last year
- Fast NPU-aware Neural Architecture Search☆21Updated 3 years ago
- PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation☆21Updated 4 years ago
- Sandbox for TVM and playing around!☆22Updated last year
- Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier☆54Updated last year
- Scailable ONNX python tools☆96Updated this week
- Tencent NCNN with added CUDA support☆67Updated 3 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 4 years ago
- Batch Normalization Auto-fusion for PyTorch☆32Updated 4 years ago
- ☆52Updated 3 years ago
- PyTorch Static Quantization Example☆39Updated 3 years ago
- ☆66Updated last year
- Implementation of the Winograd algorithm.☆20Updated 5 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆37Updated last year
- [CVPR-2023] Towards Any Structural Pruning☆17Updated last year
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆22Updated 2 years ago
- Inference Server Implementation from Scratch for Machine Learning Models☆23Updated 3 years ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆22Updated last year