tonyzhao-jt / LLM-PQ
View external linksLinks

Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and Adaptive Quantization"
36Aug 29, 2025Updated 5 months ago

Alternatives and similar repositories for LLM-PQ

Users that are interested in LLM-PQ are comparing it to the libraries listed below

Sorting:

Are these results useful?