lucidrains / native-sparse-attention-pytorchLinks

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
β˜†653Updated last week

Alternatives and similar repositories for native-sparse-attention-pytorch

Users that are interested in native-sparse-attention-pytorch are comparing it to the libraries listed below

Sorting: