Vahe1994 / SpQRLinks
☆547Updated 10 months ago
Alternatives and similar repositories for SpQR
Users that are interested in SpQR are comparing it to the libraries listed below
Sorting:
- [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization☆705Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"☆385Updated last year
- [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.☆867Updated 5 months ago
- ☆564Updated last year
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆846Updated last year
- Official PyTorch implementation of QA-LoRA☆143Updated last year
- GPTQ inference Triton kernel☆313Updated 2 years ago
- For releasing code related to compression methods for transformers, accompanying our publications☆447Updated 9 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆426Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining