facebookresearch / chaiView on GitHub
CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.
22Dec 11, 2024Updated last year

Alternatives and similar repositories for chai

Users that are interested in chai are comparing it to the libraries listed below

Sorting:

Are these results useful?