whyNLP / LCKVView on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
156Apr 7, 2025Updated 10 months ago

Alternatives and similar repositories for LCKV

Users that are interested in LCKV are comparing it to the libraries listed below

Sorting:

Are these results useful?