FMInference / H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
420Updated 6 months ago

Alternatives and similar repositories for H2O:

Users that are interested in H2O are comparing it to the libraries listed below