moranshkolnik/RobustQuantization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moranshkolnik/RobustQuantization)

moranshkolnik / RobustQuantization

source code of the paper: Robust Quantization: One Model to Rule Them All

☆42

Alternatives and similar repositories for RobustQuantization

Users that are interested in RobustQuantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ModelTC / Outlier_Suppression_Plus
View on GitHub
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆52Oct 21, 2023Updated 2 years ago
GATECH-EIC / Auto-NBA
View on GitHub
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Jan 3, 2022Updated 4 years ago
sony-si / ai-research
View on GitHub
☆49Jul 28, 2020Updated 5 years ago
mrusci / training-mixed-precision-quantized-networks
View on GitHub
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆51May 9, 2024Updated 2 years ago
nbasyl / OFQ
View on GitHub
The official implementation of the ICML 2023 paper OFQ-ViT
☆39Oct 3, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lottery-ticket / code
View on GitHub
☆13Mar 8, 2020Updated 6 years ago
submission2019 / cnn-quantization
View on GitHub
Quantization of Convolutional Neural networks.
☆250Aug 5, 2024Updated last year
cornell-zhang / dnn-quant-ocs
View on GitHub
DNN quantization with outlier channel splitting (ICML'19)
☆114Mar 21, 2020Updated 6 years ago
ynahshan / nn-quantization-pytorch
View on GitHub
☆59Dec 8, 2020Updated 5 years ago
houlu369 / Loss-aware-weight-quantization
View on GitHub
Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"
☆27Oct 24, 2019Updated 6 years ago
csyhhu / MetaQuant
View on GitHub
Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019
☆54May 8, 2020Updated 6 years ago
ziplab / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆31Mar 12, 2024Updated 2 years ago
caiwenpu / Compression_Paper
View on GitHub
☆46Sep 5, 2019Updated 6 years ago
LaVieEnRoseSMZ / AutoBNN
View on GitHub
☆45Jan 17, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhaoweicai / EdMIPS
View on GitHub
PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf
☆61Jul 27, 2020Updated 5 years ago
penhunt / full-quantization-DNN
View on GitHub
PyTorch code for full quantization of DNN using BCGD
☆14Jul 24, 2019Updated 6 years ago
itayhubara / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆97Jun 10, 2021Updated 5 years ago
yhhhli / APoT_Quantization
View on GitHub
PyTorch implementation for the APoT quantization (ICLR 2020)
☆288Dec 11, 2024Updated last year
allenbai01 / ProxQuant
View on GitHub
ProxQuant: Quantized Neural Networks via Proximal Operators
☆30Feb 19, 2019Updated 7 years ago
XinDongol / DNNAC
View on GitHub
All about acceleration and compression of Deep Neural Networks
☆33Nov 5, 2019Updated 6 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆38Aug 20, 2024Updated last year
microsoft / LQ-Nets
View on GitHub
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
☆245Aug 30, 2022Updated 3 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆315May 8, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Adamdad / Samesame
View on GitHub
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Dec 18, 2019Updated 6 years ago
hustvl / PD-Quant
View on GitHub
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆61Mar 23, 2023Updated 3 years ago
EunhyeokPark / PROFIT
View on GitHub
☆49Jan 21, 2022Updated 4 years ago
Jangho-Kim / PSG-pytorch
View on GitHub
Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)
☆27Nov 12, 2020Updated 5 years ago
ChaofanTao / FAT_Quantization
View on GitHub
Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation
☆57May 2, 2021Updated 5 years ago
cvlab-yonsei / EWGS
View on GitHub
An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.
☆97Jul 14, 2023Updated 3 years ago
jack-willturner / nas-as-program-transformation-exploration
View on GitHub
The code for our paper "Neural Architecture Search as Program Transformation Exploration"
☆17Apr 28, 2021Updated 5 years ago
SHI-Labs / Any-Precision-DNNs
View on GitHub
Any-Precision Deep Neural Networks (AAAI 2021)
☆62May 2, 2020Updated 6 years ago
bohanzhuang / Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks
View on GitHub
This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"
☆20Aug 30, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yukang2017 / NAS-quantization
View on GitHub
The code for Joint Neural Architecture Search and Quantization
☆14Apr 10, 2019Updated 7 years ago
peiswang / Two-Step-Quantization-AlexNet
View on GitHub
Two-Step Quantization on AlexNet
☆13Jun 29, 2018Updated 8 years ago
submission2019 / AnalyticalScaleForIntegerQuantization
View on GitHub
Example for applying Gaussian and Laplace clipping on activations of CNN.
☆34Jan 20, 2019Updated 7 years ago
kssteven418 / I-BERT
View on GitHub
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
☆269Jan 29, 2023Updated 3 years ago
elliothe / Ternarized_Neural_Network
View on GitHub
Optimizing Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
☆16Jan 27, 2019Updated 7 years ago
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
1adrianb / binary-nas
View on GitHub
☆35Mar 4, 2020Updated 6 years ago