Implementation of Flash Attention in Jax
☆228Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention-jax
Users that are interested in flash-attention-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆458Apr 23, 2026Updated 3 weeks ago
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆551May 8, 2026Updated last week
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆367Apr 12, 2024Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆213Jan 30, 2025Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- JAX-Toolbox☆407Updated this week
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- ☆355Apr 13, 2026Updated last month
- ☆66Aug 2, 2022Updated 3 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆135Apr 15, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple, performant and scalable Jax LLM!☆2,280Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- JAX Synergistic Memory Inspector☆186Jul 16, 2024Updated last year
- Implementation of a Transformer, but completely in Triton☆278Apr 5, 2022Updated 4 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆190Jun 24, 2022Updated 3 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Oct 17, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆705Jan 26, 2026Updated 3 months ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆161Nov 1, 2022Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆71Apr 10, 2023Updated 3 years ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,881May 11, 2026Updated last week
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆361Updated this week
- Minimal yet performant LLM examples in pure JAX☆255Apr 10, 2026Updated last month
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Apr 25, 2024Updated 2 years ago
- Library for reading and processing ML training data.☆728Updated this week
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 9 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆299Dec 20, 2023Updated 2 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- ☆940Apr 10, 2026Updated last month
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 3 years ago
- ☆19Oct 3, 2022Updated 3 years ago
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year