AI-Hypercomputer / maxdiffusionLinks

☆283

Alternatives and similar repositories for maxdiffusion

Users that are interested in maxdiffusion are comparing it to the libraries listed below

Sorting:

NVIDIA / JAX-Toolbox
JAX-Toolbox
☆364Updated this week
google / paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…
☆540Updated 2 weeks ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
google / qwix
a Jax quantization library
☆68Updated last week
facebookresearch / spdl
Scalable and Performant Data Loading
☆349Updated this week
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆123Updated 10 months ago
huggingface / kernels
Load compute kernels from the Hub
☆337Updated last week
google / saxml
☆148Updated 3 weeks ago
google / aqt
☆337Updated 2 weeks ago
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆297Updated last year
meta-pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆454Updated 3 weeks ago
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆392Updated 5 months ago
microsoft / dion
Dion optimizer algorithm
☆395Updated 2 weeks ago
young-geng / scalax
A simple library for scaling up JAX programs
☆144Updated last month
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆204Updated 2 months ago
HomebrewML / HeavyBall
Efficient optimizers
☆276Updated 3 weeks ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago
cloneofsimo / min-fsdp
☆91Updated last year
google / torchax
torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…
☆134Updated last week
google-deepmind / nanodo
☆285Updated last year
lucidrains / flash-attention-jax
Implementation of Flash Attention in Jax
☆222Updated last year
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆436Updated this week
google / praxis
☆190Updated 2 weeks ago
google-research / kauldron
Modular, scalable library to train ML models
☆176Updated last week
nebius / kvax
A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
☆149Updated 3 weeks ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 11 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
facebookresearch / optimizers
For optimization algorithm research and development.
☆547Updated 2 weeks ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆120Updated last year
AI-Hypercomputer / jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
☆78Updated 2 months ago