A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).
☆34Mar 4, 2025Updated last year
Alternatives and similar repositories for jax-flash-attn2
Users that are interested in jax-flash-attn2 are comparing it to the libraries listed below
Sorting:
- A cutting-edge text-to-image generator model that leverages state-of-the-art Stable Diffusion Model Type to produce high-quality, realist…☆13Mar 4, 2024Updated 2 years ago
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆29Feb 21, 2026Updated 2 weeks ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆339Updated this week
- OST Collection: An AI-powered suite of models that predict the next word matches with remarkable accuracy (Text Generative Models). OST C…☆16Nov 16, 2023Updated 2 years ago
- Agents for intelligence and coordination☆20Jan 4, 2026Updated 2 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- If it quacks like a tensor...☆60Nov 13, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆158Nov 11, 2025Updated 3 months ago
- Pytorch/XLA SPMD Test code in Google TPU☆23Apr 3, 2024Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆29Feb 27, 2024Updated 2 years ago
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- ☆44Nov 1, 2025Updated 4 months ago
- Python API for the EAGLE cosmological simulation database.☆12Mar 15, 2023Updated 2 years ago
- Pseudopotential converter from upf to psp8☆11Jan 25, 2023Updated 3 years ago
- Berkeley CS285 2019 homework solution☆31Mar 24, 2023Updated 2 years ago
- LM engine is a library for pretraining/finetuning LLMs☆126Updated this week
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 5 months ago
- ☆20May 24, 2025Updated 9 months ago
- Minimal JAX implementation of k-nearest neighbors using a k-d tree.☆55Jul 15, 2025Updated 7 months ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆93Jul 5, 2024Updated last year
- ☆14May 14, 2019Updated 6 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- ☆42Sep 20, 2022Updated 3 years ago
- ☆40Jan 5, 2024Updated 2 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- SO Likelihoods and Theories☆15Feb 11, 2026Updated 3 weeks ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- A scalable benchmark for state representation learning in visual reinforcement learning.☆16Jun 23, 2025Updated 8 months ago
- ☆13Jun 22, 2025Updated 8 months ago
- ☆14Mar 8, 2025Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago