erfanzar / jax-flash-attn2

Flash Attention Implementation with Multiple Backend Support and Sharding This module provides a flexible implementation of Flash Attention with support for different backends (GPU, TPU, CPU) and platforms (Triton, Pallas, JAX).
20Updated last month

Alternatives and similar repositories for jax-flash-attn2:

Users that are interested in jax-flash-attn2 are comparing it to the libraries listed below