zyxxmu / cam

Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference
27Updated 5 months ago

Related projects

Alternatives and complementary repositories for cam