google / pathways-jobLinks
PathwaysJob API is an OSS Kubernetes-native API, to deploy ML training and batch inference workloads, using Pathways on GKE.
☆15Updated last month
Alternatives and similar repositories for pathways-job
Users that are interested in pathways-job are comparing it to the libraries listed below
Sorting:
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Updated 3 years ago
- A lightweight, user-friendly data-plane for LLM training.☆37Updated 2 months ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆235Updated last week
- High-performance safetensors model loader☆74Updated last week
- Effective transpose on Hopper GPU☆26Updated 2 months ago
- torchcomms: a modern PyTorch communications API☆295Updated this week
- ☆72Updated 9 months ago
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆57Updated last week
- extensible collectives library in triton☆91Updated 7 months ago
- JAX backend for SGL☆185Updated this week
- How to ensure correctness and ship LLM generated kernels in PyTorch☆121Updated 2 weeks ago
- ☆14Updated 3 weeks ago
- Triton-based Symmetric Memory operators and examples☆63Updated last month
- MLIR-based partitioning system☆150Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated this week
- Perplexity open source garden for inference technology☆274Updated last week
- Fast low-bit matmul kernels in Triton☆398Updated last week
- Package of Pathways-on-Cloud utilities☆21Updated last week
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆217Updated last week
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 3 months ago
- ☆94Updated last year
- ring-attention experiments☆160Updated last year
- Offline optimization of your disaggregated Dynamo graph☆110Updated this week
- TORCH_LOGS parser for PT2☆65Updated 2 weeks ago
- ☆51Updated this week
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆43Updated 2 months ago
- A bunch of kernels that might make stuff slower 😉☆65Updated this week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆125Updated 2 months ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆116Updated last week
- ☆31Updated 7 months ago