AI-Hypercomputer / pathways-utilsLinks
Package of Pathways-on-Cloud utilities
☆21Updated last week
Alternatives and similar repositories for pathways-utils
Users that are interested in pathways-utils are comparing it to the libraries listed below
Sorting:
- ☆54Updated this week
- ☆25Updated last week
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆128Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆153Updated this week
- ☆15Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated this week
- ☆57Updated this week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆15Updated 11 months ago
- ☆15Updated last month
- PathwaysJob API is an OSS Kubernetes-native API, to deploy ML training and batch inference workloads, using Pathways on GKE.☆15Updated last month
- ☆21Updated 8 months ago
- PyTorch centric eager mode debugger☆48Updated 11 months ago
- Parallel framework for training and fine-tuning deep neural networks☆69Updated 2 weeks ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 10 months ago
- ☆16Updated last year
- A collection of reproducible inference engine benchmarks☆37Updated 7 months ago
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆187Updated this week
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Updated 3 years ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated this week
- ☆11Updated 11 months ago
- A set of Python scripts that makes your experience on TPU better☆54Updated 2 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆78Updated 2 months ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- 👷 Build compute kernels☆186Updated this week
- Simple repository for training small reasoning models☆46Updated 9 months ago
- train with kittens!☆63Updated last year
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 4 months ago
- ☆147Updated 3 weeks ago
- How to ensure correctness and ship LLM generated kernels in PyTorch☆121Updated 2 weeks ago