Implementation of Flash Attention in Jax
☆227Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention-jax
Users that are interested in flash-attention-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆442Mar 26, 2026Updated 2 weeks ago
- LoRA for arbitrary JAX models and functions☆145Feb 26, 2024Updated 2 years ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆550Mar 17, 2026Updated 3 weeks ago
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆367Apr 12, 2024Updated last year
- JMP is a Mixed Precision library for JAX.☆212Jan 30, 2025Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- ☆35Nov 22, 2024Updated last year
- JAX-Toolbox☆398Updated this week
- Train very large language models in Jax.☆210Oct 21, 2023Updated 2 years ago
- ☆351Updated this week
- ☆66Aug 2, 2022Updated 3 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆136Mar 17, 2026Updated 3 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- JAX Synergistic Memory Inspector☆186Jul 16, 2024Updated last year
- A simple, performant and scalable Jax LLM!☆2,201Updated this week
- Implementation of a Transformer, but completely in Triton☆279Apr 5, 2022Updated 4 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆190Jun 24, 2022Updated 3 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Oct 17, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆703Jan 26, 2026Updated 2 months ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆160Nov 1, 2022Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,840Mar 23, 2026Updated 2 weeks ago
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Apr 10, 2023Updated 2 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆350Apr 2, 2026Updated last week
- Minimal yet performant LLM examples in pure JAX☆246Jan 14, 2026Updated 2 months ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆35Apr 25, 2024Updated last year
- Library for reading and processing ML training data.☆706Updated this week
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 8 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆301Dec 20, 2023Updated 2 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- ☆931Mar 12, 2026Updated 3 weeks ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242May 12, 2023Updated 2 years ago
- ☆19Oct 3, 2022Updated 3 years ago