whyNLP / LCKV

Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
147Updated last week

Alternatives and similar repositories for LCKV:

Users that are interested in LCKV are comparing it to the libraries listed below