opengear-project / GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
157Updated 7 months ago

Alternatives and similar repositories for GEAR:

Users that are interested in GEAR are comparing it to the libraries listed below