PiotrNawrot / nano-sparse-attention

The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.
37Updated this week

Related projects

Alternatives and complementary repositories for nano-sparse-attention