AI-Hypercomputer / maxdiffusion
☆199Updated 2 weeks ago
Alternatives and similar repositories for maxdiffusion:
Users that are interested in maxdiffusion are comparing it to the libraries listed below
- Google TPU optimizations for transformers models☆107Updated 2 months ago
- ☆186Updated this week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆313Updated this week
- JAX implementation of the Llama 2 model☆217Updated last year
- Scalable and Performant Data Loading☆234Updated this week
- Focused on fast experimentation and simplicity☆71Updated 3 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆488Updated this week
- An implementation of PSGD Kron second-order optimizer for PyTorch☆86Updated 2 weeks ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆59Updated 2 weeks ago
- ☆137Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆103Updated 4 months ago
- ☆76Updated 9 months ago
- WIP☆93Updated 8 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆30Updated 5 months ago
- supporting pytorch FSDP for optimizers☆80Updated 4 months ago
- Inference code for LLaMA models in JAX☆116Updated 10 months ago
- This repository contains the experimental PyTorch native float8 training UX☆222Updated 8 months ago
- DeMo: Decoupled Momentum Optimization☆185Updated 4 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆389Updated this week
- Efficient optimizers☆188Updated this week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆123Updated 11 months ago
- A simple library for scaling up JAX programs☆134Updated 5 months ago
- ☆302Updated 9 months ago
- Faster generation with text-to-image diffusion models.☆213Updated 6 months ago
- JAX-Toolbox☆298Updated this week
- ☆205Updated 2 months ago
- ☆97Updated this week
- PyTorch per step fault tolerance (actively under development)☆274Updated this week
- ☆59Updated 4 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆270Updated 10 months ago