mit-han-lab / spatten

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
82Updated 5 months ago

Alternatives and similar repositories for spatten:

Users that are interested in spatten are comparing it to the libraries listed below