AI-Hypercomputer / maxdiffusionLinks
☆262Updated last week
Alternatives and similar repositories for maxdiffusion
Users that are interested in maxdiffusion are comparing it to the libraries listed below
Sorting:
- Google TPU optimizations for transformers models☆120Updated 8 months ago
- Dion optimizer algorithm☆360Updated this week
- a Jax quantization library☆46Updated this week
- ☆89Updated last year
- Scalable and Performant Data Loading☆304Updated 2 weeks ago
- JAX-Toolbox☆343Updated this week
- Load compute kernels from the Hub☆290Updated last week
- ☆309Updated last year
- ☆146Updated last week
- Focused on fast experimentation and simplicity☆75Updated 9 months ago
- ☆331Updated 3 weeks ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆414Updated this week
- ☆188Updated last week
- JAX implementation of the Llama 2 model☆218Updated last year
- Efficient optimizers☆265Updated this week
- ☆67Updated 10 months ago
- A library for unit scaling in PyTorch☆130Updated 2 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆73Updated 3 weeks ago
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- DeMo: Decoupled Momentum Optimization☆193Updated 10 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆536Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- ☆281Updated last year
- Implementation of Flash Attention in Jax☆219Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆379Updated 3 months ago
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- A simple library for scaling up JAX programs☆143Updated 11 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆38Updated last month
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 2 months ago