Implementation of Diffusion Transformer (DiT) in JAX
☆306Jun 11, 2024Updated last year
Alternatives and similar repositories for jax-diffusion-transformer
Users that are interested in jax-diffusion-transformer are comparing it to the libraries listed below
Sorting:
- Flow-matching algorithms in JAX☆116Aug 12, 2024Updated last year
- UNet diffusion model in pure CUDA☆657Jun 28, 2024Updated last year
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆146Nov 11, 2024Updated last year
- Schedule-Free Optimization in PyTorch☆2,257May 21, 2025Updated 9 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,549Jan 12, 2025Updated last year
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆635Jul 1, 2024Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆952Nov 16, 2025Updated 3 months ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆159Nov 1, 2022Updated 3 years ago
- JAX implementation ViT-VQGAN☆63Jul 23, 2022Updated 3 years ago
- ☆292Jul 15, 2024Updated last year
- Minimal yet performant LLM examples in pure JAX☆240Jan 14, 2026Updated last month
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Pytorch-like dataloaders for JAX.☆101Dec 16, 2025Updated 2 months ago
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆39Jan 16, 2026Updated last month
- ☆92Feb 16, 2026Updated 2 weeks ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Jan 25, 2024Updated 2 years ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆595Aug 12, 2025Updated 6 months ago
- ☆23Jun 18, 2024Updated last year
- LoRA for arbitrary JAX models and functions☆145Feb 26, 2024Updated 2 years ago
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 4 months ago
- Tile primitives for speedy kernels☆3,202Feb 24, 2026Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,781Apr 18, 2025Updated 10 months ago
- A functional training loops library for JAX☆88Feb 13, 2024Updated 2 years ago
- JAX-SPH: A Differentiable Smoothed Particle Hydrodynamics Framework☆77Oct 29, 2025Updated 4 months ago
- Graph neural networks in JAX.☆68Jun 18, 2024Updated last year
- Universal Notation for Tensor Operations in Python.☆471Apr 8, 2025Updated 10 months ago
- Implementation for MatMul-free LM.☆3,056Dec 2, 2025Updated 3 months ago
- seqax = sequence modeling + JAX☆186Jul 23, 2025Updated 7 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆261Oct 31, 2025Updated 4 months ago
- gpt-2 from scratch in mlx☆417Jun 12, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,099Aug 26, 2025Updated 6 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆170Nov 24, 2025Updated 3 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆824Dec 9, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- Library for reading and processing ML training data.☆685Feb 27, 2026Updated last week
- Unofficial JAX implementations of deep learning research papers☆161Jun 25, 2022Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆439Updated this week
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago