brianfitzgerald / jax-mmditLinks
Implementation of Diffusion Transformers and Rectified Flow in Jax
☆25Updated last year
Alternatives and similar repositories for jax-mmdit
Users that are interested in jax-mmdit are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- ☆32Updated 11 months ago
- ☆27Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- ☆34Updated last year
- research impl of Native Sparse Attention (2502.11089)☆61Updated 7 months ago
- ☆19Updated 4 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 6 months ago
- Focused on fast experimentation and simplicity☆75Updated 9 months ago
- ☆91Updated 3 years ago
- ☆39Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆85Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- ☆53Updated last year
- Writing FLUX in Triton☆40Updated last year
- ☆21Updated 11 months ago
- ☆24Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Updated 11 months ago
- The 2D discrete wavelet transform for JAX☆43Updated 2 years ago
- Combining SOAP and MUON☆16Updated 7 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 8 months ago
- Utilities for PyTorch distributed☆25Updated 7 months ago
- ☆23Updated 9 months ago
- ☆49Updated 7 months ago
- ☆24Updated 5 months ago
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆40Updated 7 months ago
- Latent Diffusion Language Models☆68Updated 2 years ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆58Updated 10 months ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆47Updated last month
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆53Updated 8 months ago