opengear-project / GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
147Updated 4 months ago

Related projects

Alternatives and complementary repositories for GEAR