BradMcDanel / sdgp
☆10Updated 3 years ago
Alternatives and similar repositories for sdgp:
Users that are interested in sdgp are comparing it to the libraries listed below
- Code for ICML 2021 submission☆34Updated 4 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- ☆43Updated last year
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated 2 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆28Updated 4 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 2 years ago
- ☆36Updated 5 months ago
- Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"☆18Updated 2 years ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆47Updated 2 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 3 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year
- Memory Optimizations for Deep Learning (ICML 2023)☆64Updated last year
- ☆76Updated 2 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆15Updated 5 years ago
- A collection of research papers on efficient training of DNNs☆70Updated 2 years ago
- MLPruning, PyTorch, NLP, BERT, Structured Pruning☆20Updated 3 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆50Updated 4 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆49Updated last year
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆30Updated 8 months ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆11Updated 2 years ago
- ☆40Updated 9 months ago
- ☆68Updated 3 months ago
- ☆29Updated last year
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆52Updated 2 years ago
- ☆55Updated last year
- ☆42Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Updated last year
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆109Updated 5 months ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆27Updated 3 years ago