facebookresearch / chaiLinks

CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.
16Updated 5 months ago

Alternatives and similar repositories for chai

Users that are interested in chai are comparing it to the libraries listed below

Sorting: