snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
112Updated 8 months ago

Alternatives and similar repositories for InfiniGen:

Users that are interested in InfiniGen are comparing it to the libraries listed below