tonyzhao-jt / LLM-PQ

Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"
27Updated 8 months ago

Related projects

Alternatives and complementary repositories for LLM-PQ