PyTorch Quantization Aware Training Example
☆150May 18, 2024Updated 2 years ago
Alternatives and similar repositories for PyTorch-Quantization-Aware-Training
Users that are interested in PyTorch-Quantization-Aware-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Static Quantization Example☆41Apr 29, 2021Updated 5 years ago
- A simple network quantization demo using pytorch from scratch.☆541Jun 18, 2023Updated 3 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- ☆16May 3, 2024Updated 2 years ago
- Brevitas: neural network quantization in PyTorch☆1,543Jun 23, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions. ICLR 2022☆11Nov 24, 2022Updated 3 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,267May 6, 2025Updated last year
- Delay estimation logic extracted from WebRTC☆18Jan 11, 2021Updated 5 years ago
- ☆27Aug 5, 2022Updated 3 years ago
- Team <skyb> solution for the AIM2020 mobile image signal processing challenge☆17Mar 15, 2021Updated 5 years ago
- Unofficial implementation of LSQ-Net, a neural network quantization framework☆315May 8, 2024Updated 2 years ago
- NanoDet for Jetson Nano☆11Sep 30, 2023Updated 2 years ago
- ONNX Runtime Inference C++ Example☆262Apr 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- yolov5第四版☆15Oct 13, 2021Updated 4 years ago
- Train cifar10 networks and inference with tensorrt.☆16Apr 16, 2020Updated 6 years ago
- Multiple-variance Volterra series Identification Tool☆16Sep 25, 2021Updated 4 years ago
- 使用RV1126部署YOLOv5模型☆16May 23, 2024Updated 2 years ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,397May 11, 2026Updated last month
- PyTorch implementation for the APoT quantization (ICLR 2020)☆288Dec 11, 2024Updated last year
- ☆212Nov 9, 2021Updated 4 years ago
- fast, lightweight dbscan implementation for peptide strings☆12Apr 29, 2020Updated 6 years ago
- Quantize,Pytorch,Vgg16,MobileNet☆44Jan 29, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,647Updated this week
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,322Sep 7, 2025Updated 9 months ago
- 将MNN拆解的简易前向推理框架(for study!)☆24Feb 21, 2021Updated 5 years ago
- Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".☆37Jul 19, 2020Updated 5 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 3 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networ…☆30Sep 13, 2022Updated 3 years ago
- Complex Neural Beamformer☆33Oct 15, 2020Updated 5 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Jun 29, 2023Updated 3 years ago
- Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"☆326Mar 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- percepnet implemented using Keras, still need to be optimized and tuned.☆39Jul 23, 2021Updated 4 years ago
- I'm going to use the Winograd’s minimal filtering algorithms to introduce a new class of fast algorithms for convolutional neural networks…☆12Mar 22, 2018Updated 8 years ago
- Model Quantization Benchmark☆868Apr 20, 2025Updated last year
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆44Oct 13, 2023Updated 2 years ago
- PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by …☆429Feb 27, 2020Updated 6 years ago
- Inference of quantization aware trained networks using TensorRT☆86Jan 27, 2023Updated 3 years ago
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month