Implementation of Flash Attention in Jax
☆228Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention-jax
Users that are interested in flash-attention-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆464Jun 1, 2026Updated 3 weeks ago
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆556Jun 4, 2026Updated 3 weeks ago
- JAX implementation of the Llama 2 model☆217Feb 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆370Apr 12, 2024Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆214Jan 30, 2025Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated 2 years ago
- ☆36Nov 22, 2024Updated last year
- JAX-Toolbox☆415Jun 22, 2026Updated last week
- Train very large language models in Jax.☆208Oct 21, 2023Updated 2 years ago
- ☆357Apr 13, 2026Updated 2 months ago
- ☆66Aug 2, 2022Updated 3 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆135Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple, performant and scalable Jax LLM!☆2,338Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- JAX Synergistic Memory Inspector☆186Jul 16, 2024Updated last year
- Implementation of a Transformer, but completely in Triton☆279Apr 5, 2022Updated 4 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆189Jun 24, 2022Updated 4 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Oct 17, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆706Jan 26, 2026Updated 5 months ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆161Nov 1, 2022Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆71Apr 10, 2023Updated 3 years ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,911Jun 13, 2026Updated 2 weeks ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 4 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆367Updated this week
- Minimal yet performant LLM examples in pure JAX☆261Apr 10, 2026Updated 2 months ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 4 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Apr 25, 2024Updated 2 years ago
- Library for reading and processing ML training data.☆747Updated this week
- A simple library for scaling up JAX programs☆148Nov 4, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 11 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆299Jun 2, 2026Updated 3 weeks ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 5 years ago
- ☆946Jun 12, 2026Updated 2 weeks ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 3 years ago
- ☆19Oct 3, 2022Updated 3 years ago
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year