snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
106Updated 7 months ago

Alternatives and similar repositories for InfiniGen:

Users that are interested in InfiniGen are comparing it to the libraries listed below