zyxxmu / cam

Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference
31Updated 7 months ago

Alternatives and similar repositories for cam:

Users that are interested in cam are comparing it to the libraries listed below