facebookresearch / chaiLinks

CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.
22Updated last year

Alternatives and similar repositories for chai

Users that are interested in chai are comparing it to the libraries listed below

Sorting: