whyNLP / LCKVView on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.
157Apr 7, 2025Updated last year

Alternatives and similar repositories for LCKV

Users that are interested in LCKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?