mit-han-lab / spattenLinks

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
119Updated last year

Alternatives and similar repositories for spatten

Users that are interested in spatten are comparing it to the libraries listed below

Sorting: