A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/Pallas/JAX).
☆34Mar 4, 2025Updated last year
Alternatives and similar repositories for jax-flash-attn2
Users that are interested in jax-flash-attn2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A cutting-edge text-to-image generator model that leverages state-of-the-art Stable Diffusion Model Type to produce high-quality, realist…☆13Mar 4, 2024Updated 2 years ago
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆30Updated this week
- Xerxes, a highly advanced Persian AI assistant developed by InstinctAI, a cutting-edge AI startup. primary function is to assist users wi…☆11Apr 27, 2024Updated last year
- OST Collection: An AI-powered suite of models that predict the next word matches with remarkable accuracy (Text Generative Models). OST C…☆16Nov 16, 2023Updated 2 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆355Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Agents for intelligence and coordination☆20Updated this week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago
- If it quacks like a tensor...☆60Nov 13, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 4 months ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆162Nov 11, 2025Updated 5 months ago
- ☆40Dec 14, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pytorch/XLA SPMD Test code in Google TPU☆23Apr 3, 2024Updated 2 years ago
- Learn everything you need to know about any academic figure☆16May 10, 2024Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- Homework 3 for Berkeley CS 280: our version of the MIT Mini Places challenge☆12Mar 5, 2016Updated 10 years ago
- Implementation of Computer Vision Models in JAX (equinox)☆23Apr 4, 2026Updated 2 weeks ago
- Python API for the EAGLE cosmological simulation database.☆12Mar 15, 2023Updated 3 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- A Jax wrapper for cudaKDTree☆11Sep 26, 2025Updated 6 months ago
- Example code snipped to visualize a neural network fitting a surface to random points in space☆12Dec 26, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Push Notification App for Flutter☆13Aug 6, 2024Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- CIFAR10 ResNets implemented in JAX+Flax☆12Apr 6, 2022Updated 4 years ago
- Benchmarking field-level cosmological inference from galaxy surveys.☆13Jul 17, 2025Updated 9 months ago
- SO Likelihoods and Theories☆16Updated this week
- KANs and MLPs☆12Jun 7, 2024Updated last year
- Euclid Visible Instrument Python package. Includes a simulator and various analysis codes.☆11Mar 25, 2015Updated 11 years ago
- ☆14Jun 22, 2025Updated 9 months ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Jun 22, 2024Updated last year
- Lightweight and minimal dom template and ajax helpers☆19Dec 15, 2023Updated 2 years ago
- NVidia sass disassembler/inline patcher☆67Updated this week
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- ☆12May 30, 2025Updated 10 months ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago