snu-comparch / InfiniGen

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
79Updated 4 months ago

Related projects

Alternatives and complementary repositories for InfiniGen