PiotrNawrot / nano-sparse-attention

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.
55Updated last month

Alternatives and similar repositories for nano-sparse-attention:

Users that are interested in nano-sparse-attention are comparing it to the libraries listed below