vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
661Updated this week

Related projects

Alternatives and complementary repositories for llm-compressor