DD-DuDa / BitDecodingLinks

[HPCA 2025] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.
62Updated this week

Alternatives and similar repositories for BitDecoding

Users that are interested in BitDecoding are comparing it to the libraries listed below

Sorting: