AI-Hypercomputer / cloud-accelerator-diagnosticsLinks
☆23Updated 2 weeks ago
Alternatives and similar repositories for cloud-accelerator-diagnostics
Users that are interested in cloud-accelerator-diagnostics are comparing it to the libraries listed below
Sorting:
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆535Updated 2 weeks ago
- ☆330Updated this week
- ☆146Updated last month
- ☆188Updated 2 weeks ago
- A JAX-native LLM Post-Training Library☆143Updated this week
- jax-triton contains integrations between JAX and OpenAI Triton☆419Updated 2 weeks ago
- Minimal yet performant LLM examples in pure JAX☆158Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆401Updated 2 weeks ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆375Updated 3 months ago
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- JAX-Toolbox☆335Updated this week
- seqax = sequence modeling + JAX☆167Updated last month
- torchprime is a reference model implementation for PyTorch on TPU.☆36Updated this week
- Implementation of Flash Attention in Jax☆216Updated last year
- JAX Synergistic Memory Inspector☆179Updated last year
- 🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…☆265Updated last month
- ☆534Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆296Updated last week
- ☆361Updated last year
- Load compute kernels from the Hub☆283Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆658Updated this week
- ☆118Updated last year
- Accelerated First Order Parallel Associative Scan☆188Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆139Updated 5 months ago
- A library for unit scaling in PyTorch☆130Updated 2 months ago
- JAX implementation of the Llama 2 model☆219Updated last year
- JAX bindings for Flash Attention v2☆91Updated last week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆71Updated 5 months ago
- PyTorch Single Controller☆414Updated this week
- Everything you want to know about Google Cloud TPU☆545Updated last year