PyTorch Quantization Aware Training Example
☆150May 18, 2024Updated last year
Alternatives and similar repositories for PyTorch-Quantization-Aware-Training
Users that are interested in PyTorch-Quantization-Aware-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Static Quantization Example☆41Apr 29, 2021Updated 5 years ago
- A simple network quantization demo using pytorch from scratch.☆542Jun 18, 2023Updated 2 years ago
- PyTorch Pruning Example☆53Dec 5, 2022Updated 3 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- ☆16May 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Brevitas: neural network quantization in PyTorch☆1,524Apr 23, 2026Updated last week
- Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions. ICLR 2022☆11Nov 24, 2022Updated 3 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,271May 6, 2025Updated 11 months ago
- electron、vue3、vite、ts、element-plus、vue-router、eslint、prettier☆10Nov 16, 2021Updated 4 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Delay estimation logic extracted from WebRTC☆18Jan 11, 2021Updated 5 years ago
- ☆27Aug 5, 2022Updated 3 years ago
- Team <skyb> solution for the AIM2020 mobile image signal processing challenge☆16Mar 15, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Transformation process of a Python Pytorch GPU model into an optimized TensorRT C++ one.☆13Mar 8, 2021Updated 5 years ago
- NanoDet for Jetson Nano☆11Sep 30, 2023Updated 2 years ago
- ONNX Runtime Inference C++ Example☆260Apr 3, 2025Updated last year
- yolov5第四版☆15Oct 13, 2021Updated 4 years ago
- For CPU experiment☆14Feb 23, 2021Updated 5 years ago
- 使用RV1126部署YOLOv5模型☆15May 23, 2024Updated last year
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,360Apr 25, 2026Updated last week
- ☆210Nov 9, 2021Updated 4 years ago
- fast, lightweight dbscan implementation for peptide strings☆12Apr 29, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Quantize,Pytorch,Vgg16,MobileNet☆44Jan 29, 2021Updated 5 years ago
- these days I have downed l lot of papers about action recognition,all of them from cvpr/iccv/nips and so on☆16Sep 12, 2018Updated 7 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,604Apr 24, 2026Updated last week
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,299Sep 7, 2025Updated 7 months ago
- 将MNN拆解的简易前向推理框架(for study!)☆24Feb 21, 2021Updated 5 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networ…☆30Sep 13, 2022Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Jun 29, 2023Updated 2 years ago
- Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"☆324Mar 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 8 years ago
- Model Quantization Benchmark☆866Apr 20, 2025Updated last year
- PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by …☆427Feb 27, 2020Updated 6 years ago
- Inference of quantization aware trained networks using TensorRT☆85Jan 27, 2023Updated 3 years ago
- ipython notebooks for feature extraction and training of audio event classifier on ESC-50 dataset.☆10Mar 16, 2018Updated 8 years ago
- Python Speex☆24Aug 10, 2017Updated 8 years ago
- ☆52Jan 2, 2021Updated 5 years ago