snu-comparch / InfiniGenView on GitHub
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
181Jul 10, 2024Updated last year

Alternatives and similar repositories for InfiniGen

Users that are interested in InfiniGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?