SNU-ARC / any-precision-llmLinks
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
☆123Updated 6 months ago
Alternatives and similar repositories for any-precision-llm
Users that are interested in any-precision-llm are comparing it to the libraries listed below
Sorting:
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆68Updated last year
- ☆60Updated last year
- LLM Inference with Microscaling Format