lucidrains / flash-attention-jaxLinks

Implementation of Flash Attention in Jax

☆222

Alternatives and similar repositories for flash-attention-jax

Users that are interested in flash-attention-jax are comparing it to the libraries listed below

Sorting:

lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆277Updated 3 years ago
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆436Updated last week
google-deepmind / jmp
JMP is a Mixed Precision library for JAX.
☆211Updated 10 months ago
google / praxis
☆190Updated 2 weeks ago
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆182Updated last year
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆143Updated last year
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
young-geng / scalax
A simple library for scaling up JAX programs
☆144Updated 3 weeks ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆189Updated 3 years ago
google / flaxformer
☆363Updated last year
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆132Updated 4 months ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆210Updated 2 years ago
nshepperd / flash_attn_jax
JAX bindings for Flash Attention v2
☆99Updated last month
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆192Updated last year
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆364Updated this week
google-research / jaxpruner
☆234Updated 9 months ago
sholtodouglas / scalingExperiments
☆62Updated 3 years ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
google / aqt
☆337Updated last week
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆184Updated 2 years ago
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆204Updated 2 months ago
young-geng / mlxu
Machine Learning eXperiment Utilities
☆46Updated 4 months ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆120Updated last year
cloneofsimo / min-fsdp
☆91Updated last year
google / jaxonnxruntime
A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.
☆125Updated 2 months ago
openxla / tokamax
Tokamax: A GPU and TPU kernel library.
☆122Updated this week
mgmalek / efficient_cross_entropy
☆121Updated last year
davisyoshida / qax
If it quacks like a tensor...
☆59Updated last year
meta-pytorch / float8_experimental
This repository contains the experimental PyTorch native float8 training UX
☆226Updated last year
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago