GATECH-EIC / FracTrain
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Haoran You, Yang Zhao, Yue Wang, Chaojian Li, Kailash Gopalakrishnan, Zhangyang Wang, Yingyan Lin
☆11Updated 2 years ago
Related projects: ⓘ
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆26Updated 11 months ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆36Updated 3 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆14Updated 2 years ago
- Simulator for BitFusion☆85Updated 4 years ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆12Updated 2 years ago
- ☆27Updated 4 years ago
- Post-training sparsity-aware quantization☆32Updated last year
- Conditional channel- and precision-pruning on neural networks☆71Updated 4 years ago
- ☆18Updated 2 years ago
- Code for ICML 2021 submission☆35Updated 3 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆10Updated 2 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆56Updated 4 years ago
- ☆19Updated 3 years ago
- Neural Network Quantization With Fractional Bit-widths☆12Updated 3 years ago
- [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…☆22Updated 2 months ago
- QuickEst repository: Quick Estimation of Quality of Results☆26Updated 5 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆57Updated 4 years ago
- [ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vi…☆30Updated 6 months ago
- ☆30Updated 3 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆49Updated last year
- mixed-precision quantization for LLMs☆12Updated 10 months ago
- This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"☆20Updated 3 years ago
- The code for Joint Neural Architecture Search and Quantization☆13Updated 5 years ago
- Torch-7 implementation of BinaryDuo (ICLR 2020).☆9Updated 3 years ago
- TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights☆15Updated 4 years ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆44Updated last year
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs☆21Updated 2 months ago
- ☆67Updated 2 years ago
- Reproduction of WAGE in PyTorch.☆41Updated 5 years ago
- Measuring and predicting on-device metrics (latency, power, etc.) of machine learning models☆66Updated last year