tonyzhao-jt / LLM-PQView on GitHub
Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and Adaptive Quantization"
37Aug 29, 2025Updated 7 months ago

Alternatives and similar repositories for LLM-PQ

Users that are interested in LLM-PQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?