lucidrains / native-sparse-attention-pytorchLinks

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
β˜†637Updated 2 weeks ago

Alternatives and similar repositories for native-sparse-attention-pytorch

Users that are interested in native-sparse-attention-pytorch are comparing it to the libraries listed below

Sorting: