tonyzhao-jt / LLM-PQ

Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"
28Updated last year

Alternatives and similar repositories for LLM-PQ:

Users that are interested in LLM-PQ are comparing it to the libraries listed below