☆43Jan 30, 2024Updated 2 years ago
Alternatives and similar repositories for HAP
Users that are interested in HAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆34Oct 11, 2021Updated 4 years ago
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆29Oct 31, 2020Updated 5 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆28Feb 7, 2023Updated 3 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆54Dec 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆460May 15, 2023Updated 2 years ago
- Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.☆138Apr 28, 2022Updated 3 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Feb 7, 2023Updated 3 years ago
- MLPruning, PyTorch, NLP, BERT, Structured Pruning☆20Jun 29, 2021Updated 4 years ago
- ☆19Mar 17, 2021Updated 5 years ago
- A 8-/16-/32-/64-bit floating point number family☆16Feb 4, 2022Updated 4 years ago
- ☆13Nov 29, 2021Updated 4 years ago
- Example for applying Gaussian and Laplace clipping on activations of CNN.☆34Jan 20, 2019Updated 7 years ago
- PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks☆779Jul 10, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Feb 5, 2024Updated 2 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62May 2, 2020Updated 5 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆282Dec 8, 2023Updated 2 years ago
- [KDD'22] Learned Token Pruning for Transformers☆98Feb 27, 2023Updated 3 years ago
- [ICML'21 Oral] I-BERT: Integer-only BERT Quantization☆267Jan 29, 2023Updated 3 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Nov 12, 2020Updated 5 years ago
- [CVPR'20] ZeroQ Mixed-Precision implementation (unofficial): A Novel Zero Shot Quantization Framework☆14Dec 16, 2020Updated 5 years ago
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DNN quantization with outlier channel splitting (ICML'19)☆114Mar 21, 2020Updated 6 years ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆36Feb 10, 2026Updated 2 months ago
- DropNet: Reducing Neural Network Complexity via Iterative Pruning (ICML 2020)☆16Aug 24, 2020Updated 5 years ago
- All about acceleration and compression of Deep Neural Networks☆33Nov 5, 2019Updated 6 years ago
- ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training☆198Dec 22, 2022Updated 3 years ago
- PyTorch-Implementation of "Data-Driven Sparse Structure Selection for Deep Neural Networks"☆21Apr 17, 2020Updated 5 years ago
- ☆15Oct 26, 2022Updated 3 years ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆58Nov 20, 2024Updated last year
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆73Oct 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34May 21, 2023Updated 2 years ago
- pyhessian is a TensorFlow module which can be used to estimate Hessian matrices☆25Mar 26, 2021Updated 5 years ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆130Jul 11, 2023Updated 2 years ago
- Code to implement the experiments in "Post-training Quantization for Neural Networks with Provable Guarantees" by Jinjie Zhang, Yixuan Zh…☆11Jun 2, 2023Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆163Nov 28, 2022Updated 3 years ago
- [ACL 2025 Long Main] Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions☆40Apr 21, 2025Updated 11 months ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆285Feb 27, 2023Updated 3 years ago