lucidrains / native-sparse-attention-pytorch

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
β˜†561Updated this week

Alternatives and similar repositories for native-sparse-attention-pytorch:

Users that are interested in native-sparse-attention-pytorch are comparing it to the libraries listed below