kelechi-c / dit_flowLinks
DiT (training + flow matching) in Jax
☆9Updated 6 months ago
Alternatives and similar repositories for dit_flow
Users that are interested in dit_flow are comparing it to the libraries listed below
Sorting:
- ☆132Updated 2 weeks ago
- ☆110Updated last month
- Minimal but scalable implementation of large language models in JAX☆35Updated last week
- Implementation of PSGD optimizer in JAX☆33Updated 6 months ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Updated 9 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆11Updated last year
- supporting pytorch FSDP for optimizers☆82Updated 7 months ago
- LoRA for arbitrary JAX models and functions☆140Updated last year
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆97Updated 6 months ago
- ☆197Updated 7 months ago
- A simple library for scaling up JAX programs☆139Updated 8 months ago
- ☆31Updated 7 months ago
- Lightning-like training API for JAX with Flax☆42Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆84Updated last year
- 🧱 Modula software package☆204Updated 3 months ago
- Pytorch-like dataloaders for JAX.☆90Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆129Updated last year
- Turn jitted jax functions back into python source code☆22Updated 7 months ago
- Einsum-like high-level array sharding API for JAX☆35Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆34Updated 8 months ago
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- Efficient optimizers☆234Updated this week
- ☆61Updated 8 months ago
- ☆32Updated last year
- ☆53Updated last year
- ☆17Updated 10 months ago
- ☆83Updated last week
- Implementation of GateLoop Transformer in Pytorch and Jax☆89Updated last year
- Jax/Flax rewrite of Karpathy's nanoGPT☆59Updated 2 years ago
- ☆81Updated 8 months ago