AnswerDotAI / cold-compress

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.
106Updated 5 months ago

Alternatives and similar repositories for cold-compress:

Users that are interested in cold-compress are comparing it to the libraries listed below