thu-ml / SpargeAttn

SpargeAttention: A training-free sparse attention that can accelerate any model inference.
385Updated 2 weeks ago

Alternatives and similar repositories for SpargeAttn:

Users that are interested in SpargeAttn are comparing it to the libraries listed below