AI-Hypercomputer / ml-goodput-measurement
☆15Updated 2 weeks ago
Alternatives and similar repositories for ml-goodput-measurement
Users that are interested in ml-goodput-measurement are comparing it to the libraries listed below
Sorting:
- Experimenting with how best to do multi-host dataloading☆10Updated 2 years ago
- ☆138Updated 2 weeks ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆60Updated last month
- An implementation of the Llama architecture, to instruct and delight☆21Updated 4 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆9Updated 2 weeks ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆32Updated 6 months ago
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Updated last year
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆120Updated last week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated this week
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated 11 months ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆24Updated 7 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated last month
- DPO, but faster 🚀☆42Updated 5 months ago
- A sample pattern for running CI tests on Modal☆17Updated last month
- ☆21Updated 2 months ago
- ☆17Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆63Updated this week
- Machine Learning eXperiment Utilities☆46Updated 11 months ago
- ☆19Updated last month
- ☆31Updated last month
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 11 months ago
- NanoGPT (124M) in 5 minutes☆10Updated 3 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- ☆27Updated last week
- This is a port of Mistral-7B model in JAX☆32Updated 10 months ago
- Triton Server Component for lightning.ai☆14Updated 2 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 9 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 5 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 6 months ago