Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"
☆27Oct 24, 2019Updated 6 years ago
Alternatives and similar repositories for Loss-aware-weight-quantization
Users that are interested in Loss-aware-weight-quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆20Feb 24, 2019Updated 7 years ago
- ProxQuant: Quantized Neural Networks via Proximal Operators☆30Feb 19, 2019Updated 7 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆21Nov 15, 2020Updated 5 years ago
- Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation☆66May 2, 2021Updated 5 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆40Mar 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.☆12Aug 15, 2019Updated 6 years ago
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54May 8, 2020Updated 5 years ago
- ☆19Mar 16, 2022Updated 4 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62May 2, 2020Updated 6 years ago
- ☆14Feb 7, 2020Updated 6 years ago
- DNN quantization with outlier channel splitting (ICML'19)☆114Mar 21, 2020Updated 6 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- ☆11Jan 10, 2025Updated last year
- This script is for photographers to do timeslice with one click.☆13Aug 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jul 25, 2024Updated last year
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆16Jan 16, 2020Updated 6 years ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Efficient Sparse-Winograd Convolutional Neural Networks (ICLR 2018)☆192May 7, 2019Updated 6 years ago
- Implementation of NeurIPS 2019 paper "Normalization Helps Training of Quantized LSTM"☆31Jul 25, 2024Updated last year
- Training Quantized Neural Networks with a Full-precision Auxiliary Module☆13Jun 19, 2020Updated 5 years ago
- Learning to share: simultaneous parameter tying and sparsification in deep learning☆13Aug 21, 2018Updated 7 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆245Aug 30, 2022Updated 3 years ago
- Codes for Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?☆31Oct 7, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A PyTorch implementation of MixNet: Mixed Depthwise Convolutional Kernels☆11Aug 5, 2019Updated 6 years ago
- Using fixed-point arithmetic in a modern FPGA to produce cool sounds by modeling a 1970s-era Moog-like synthesizer.☆22Dec 2, 2018Updated 7 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆336Jul 25, 2024Updated last year
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆129Jan 2, 2020Updated 6 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Sep 9, 2025Updated 7 months ago
- PyTorch Implementation of XNOR-Net☆493Feb 27, 2023Updated 3 years ago
- Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search☆89May 29, 2019Updated 6 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 11 months ago
- ☆14Oct 24, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- Code for “Discrimination-aware-Channel-Pruning-for-Deep-Neural-Networks”☆184Oct 29, 2020Updated 5 years ago
- Implement Towards Effective Low-bitwidth Convolutional Neural Networks☆41Sep 17, 2018Updated 7 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆51May 9, 2024Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- [ICML'21 Oral] I-BERT: Integer-only BERT Quantization☆268Jan 29, 2023Updated 3 years ago
- Implementation for Trained Ternary Network.☆108Jan 13, 2017Updated 9 years ago