mit-han-lab / spatten

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
81Updated 4 months ago

Alternatives and similar repositories for spatten:

Users that are interested in spatten are comparing it to the libraries listed below