thu-ml / SpargeAttn

SpargeAttention: A training-free sparse attention that can accelerate any model inference.
453Updated this week

Alternatives and similar repositories for SpargeAttn:

Users that are interested in SpargeAttn are comparing it to the libraries listed below