This repo contains the code for studying the interplay between quantization and sparsity methods
☆26Feb 26, 2025Updated last year
Alternatives and similar repositories for quantization-sparsity-interplay
Users that are interested in quantization-sparsity-interplay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆41Nov 22, 2025Updated 6 months ago
- ☆21Oct 2, 2024Updated last year
- Implementation of Input Stationary, Weight Stationary and Output Stationary dataflow for given neural network on a tiled architecture☆10Apr 19, 2020Updated 6 years ago
- ☆19Nov 11, 2024Updated last year
- The official implementation of the DAC 2024 paper GQA-LUT☆23Dec 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A bit-level sparsity-awared multiply-accumulate process element.☆19Jul 9, 2024Updated last year
- [NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models