Implementation of Flash Attention in Jax
☆228Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for flash-attention-jax
Users that are interested in flash-attention-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆447Updated this week
- LoRA for arbitrary JAX models and functions☆144Feb 26, 2024Updated 2 years ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆549Apr 9, 2026Updated 2 weeks ago
- JAX implementation of the Llama 2 model☆216Feb 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆367Apr 12, 2024Updated 2 years ago
- JMP is a Mixed Precision library for JAX.☆212Jan 30, 2025Updated last year
- Inference code for LLaMA models in JAX☆120May 21, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- JAX-Toolbox☆404Updated this week
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- ☆353Apr 13, 2026Updated 2 weeks ago
- ☆66Aug 2, 2022Updated 3 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆135Apr 15, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A simple, performant and scalable Jax LLM!☆2,255Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- JAX Synergistic Memory Inspector☆187Jul 16, 2024Updated last year
- Implementation of a Transformer, but completely in Triton☆279Apr 5, 2022Updated 4 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆190Jun 24, 2022Updated 3 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Oct 17, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆704Jan 26, 2026Updated 3 months ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆160Nov 1, 2022Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"☆70Apr 10, 2023Updated 3 years ago
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,858Apr 20, 2026Updated last week
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆355Apr 22, 2026Updated last week
- Minimal yet performant LLM examples in pure JAX☆250Apr 10, 2026Updated 2 weeks ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆36Apr 25, 2024Updated 2 years ago
- Library for reading and processing ML training data.☆717Updated this week
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 9 months ago
- Minimal library to train LLMs on TPU in JAX with pjit().☆299Dec 20, 2023Updated 2 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 4 years ago
- ☆936Apr 10, 2026Updated 2 weeks ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241May 12, 2023Updated 2 years ago
- ☆19Oct 3, 2022Updated 3 years ago
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year