Qualcomm-AI-research/pruning-vs-quantization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Qualcomm-AI-research/pruning-vs-quantization)

Qualcomm-AI-research / pruning-vs-quantization

☆26

Alternatives and similar repositories for pruning-vs-quantization

Users that are interested in pruning-vs-quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deJQK / FracBits
View on GitHub
Neural Network Quantization With Fractional Bit-widths
☆11Feb 19, 2021Updated 5 years ago
TanayNarshana / DFPC-Pruning
View on GitHub
[ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.
☆15Aug 25, 2023Updated 2 years ago
hustvl / PD-Quant
View on GitHub
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆61Mar 23, 2023Updated 3 years ago
Qualcomm-AI-research / outlier-free-transformers
View on GitHub
☆46Dec 20, 2023Updated 2 years ago
IMPETUS-UdeS / rule4ml
View on GitHub
Resource Utilization and Latency Estimation for ML on FPGA.
☆20Apr 11, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Qualcomm-AI-research / BayesianBits
View on GitHub
☆22Feb 11, 2022Updated 4 years ago
pprp / STBLLM
View on GitHub
[ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs
☆20Jun 3, 2025Updated last year
Qualcomm-AI-research / oscillations-qat
View on GitHub
☆81Jul 21, 2022Updated 4 years ago
Qualcomm-AI-research / gptvq
View on GitHub
☆42Mar 28, 2024Updated 2 years ago
Qualcomm-AI-research / lr-qat
View on GitHub
☆54Nov 5, 2024Updated last year
inEXASCALE / pychop
View on GitHub
A Python package for simulating low precision arithmetic in scientific computing and machine learning
☆21Jun 7, 2026Updated last month
insuhan / calibquant
View on GitHub
☆21Apr 3, 2025Updated last year
facebookresearch / SecureFLCompression
View on GitHub
Compression primitives for uplink compression in Federated Learning that are compatible with Secure Aggregation.
☆11Jul 27, 2022Updated 3 years ago
1157942086 / CVPR2020_Auxiliary_Quantization
View on GitHub
Training Quantized Neural Networks with a Full-precision Auxiliary Module
☆13Jun 19, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bytedance / MRECG
View on GitHub
☆36Mar 29, 2023Updated 3 years ago
L3030 / FedCyBGD
View on GitHub
The implement of FedCyBGD
☆12Jul 19, 2024Updated 2 years ago
ModelTC / L2_Compression
View on GitHub
☆13Jun 16, 2024Updated 2 years ago
mlzxy / qsparse
View on GitHub
Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
☆42Sep 19, 2022Updated 3 years ago
EricLoong / feddip
View on GitHub
The official code for ICDM2023 paper: ' FedDIP: Federated Learning with Extreme Dynamic Pruning and Incremental Regularization'
☆14Aug 16, 2024Updated last year
BrotherHappy / OSTQuant
View on GitHub
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆93Apr 8, 2025Updated last year
Qualcomm-AI-research / llm-surgeon
View on GitHub
☆35May 24, 2024Updated 2 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆38Aug 20, 2024Updated last year
papers-submission / CalibTIP
View on GitHub
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆36Jun 29, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆211Nov 9, 2021Updated 4 years ago
lihuantong / HAST
View on GitHub
☆14Oct 6, 2023Updated 2 years ago
casszhao / PruneHall
View on GitHub
Codebase, data and models for hallucination of pruned models
☆16Jan 11, 2025Updated last year
calad0i / HGQ
View on GitHub
Legacy High Granularity Quantizarion 1 - Please use HGQ2 instead (https://github.com/calad0i/HGQ)
☆40Mar 13, 2026Updated 4 months ago
news-vt / Green-Quantized-FL-over-Wireless-Networks-An-Energy-Efficient-Design
View on GitHub
This is a repository for the implementation of the paper "Green, Quantized Federated Learning over Wireless Networks: An Energy-Efficient…
☆14Jul 1, 2023Updated 3 years ago
fangvv / FL-PQSU
View on GitHub
Code for paper "Accelerating Federated Learning for IoT in Big Data Analytics with Pruning, Quantization and Selective Updating"
☆13Jun 17, 2026Updated last month
GATECH-EIC / ShiftAddNet
View on GitHub
[NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network
☆74Nov 16, 2020Updated 5 years ago
WeixiangXu / STTN
View on GitHub
☆17Oct 25, 2022Updated 3 years ago
zysxmu / FDDA
View on GitHub
Pytorch implementation of our paper accepted by ECCV 2022-- Fine-grained Data Distribution Alignment for Post-Training Quantization
☆16Sep 13, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
glory20h / FitHuBERT
View on GitHub
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆19Nov 15, 2023Updated 2 years ago
ModelTC / AAAI2023_EAMPD
View on GitHub
AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline
☆13Nov 29, 2022Updated 3 years ago
zejiangh / Filter-GaP
View on GitHub
The official PyTorch implementation of CHEX: CHannel EXploration for CNN Model Compression (CVPR 2022). Paper is available at https://ope…
☆37Jul 2, 2022Updated 4 years ago
xvyaward / owq
View on GitHub
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆72Mar 7, 2024Updated 2 years ago
lizhuangzi / IGAN
View on GitHub
Code of ACM MM 2021 paper: Information-Growth Attention Network for Image Super-Resolution
☆21Dec 1, 2021Updated 4 years ago
YINYIPENG-EN / Pruning_for_yolov4
View on GitHub
对yolov4进行通道剪枝
☆15Jun 20, 2022Updated 4 years ago
hfutqian / AdaDFQ
View on GitHub
☆22Oct 27, 2024Updated last year