softmax1 / Flash-Attention-Softmax-N

CUDA and Triton implementations of Flash Attention with SoftmaxN.
67Updated 8 months ago

Alternatives and similar repositories for Flash-Attention-Softmax-N:

Users that are interested in Flash-Attention-Softmax-N are comparing it to the libraries listed below