mit-han-lab / spatten

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
75Updated 2 months ago

Related projects

Alternatives and complementary repositories for spatten