jiahansu / GPUAR
A CUDA implementation of Arithmetic Coding
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for GPUAR
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 2 years ago
- ☆11Updated 2 years ago
- Code for ICML 2021 submission☆35Updated 3 years ago
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆129Updated 4 years ago
- Pytorch implementation of RAPQ, IJCAI 2022☆21Updated last year
- Any-Precision Deep Neural Networks (AAAI 2021)☆56Updated 4 years ago
- Post-training sparsity-aware quantization☆33Updated last year
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers☆16Updated 8 months ago
- A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.☆32Updated 2 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆26Updated 4 years ago
- ☆42Updated 9 months ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆49Updated last year
- YAECL: Yet Another Entropy Coding Library for Neural Compression Research, with Arithmetic Coding and Asymmetric Numeral Systems support☆34Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆95Updated 3 years ago
- ☆68Updated 2 years ago
- ☆17Updated 2 years ago
- Code for paper "Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach"☆18Updated 4 years ago
- [NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network☆69Updated 4 years ago
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆77Updated 2 years ago
- LSQ+ or LSQplus☆59Updated last year
- ☆15Updated 2 years ago
- ☆12Updated 2 years ago
- Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks☆67Updated 3 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Updated 3 years ago
- ☆12Updated 2 years ago
- Data compression in JAX☆57Updated this week
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆15Updated 4 years ago
- ☆24Updated 2 years ago
- Official code repo for NeurIPS 2020 paper "Improving Inference for Neural Image Compression"☆47Updated last year