FMInference / H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
391Updated 3 months ago

Related projects

Alternatives and complementary repositories for H2O