coreweave / ml-containers
☆21Updated this week
Related projects: ⓘ
- Module, Model, and Tensor Serialization/Deserialization☆175Updated 3 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆59Updated last month
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆66Updated last week
- Model compression for ONNX☆67Updated last week
- Repository for open inference protocol specification☆41Updated 2 months ago
- The Triton backend for the PyTorch TorchScript models.☆117Updated last week
- A top-like tool for monitoring GPUs in a cluster☆80Updated 7 months ago
- ☆43Updated 3 months ago
- CUDA checkpoint and restore utility☆193Updated 5 months ago
- Controller for ModelMesh☆200Updated 2 months ago
- ☆170Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆95Updated this week
- OpenVINO backend for Triton.☆29Updated 2 weeks ago
- ☆158Updated this week
- MLPerf™ logging library☆30Updated last week
- ModelMesh Performance Scripts, Dashboard and Pipelines☆10Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆121Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆178Updated last week
- MIG Partition Editor for NVIDIA GPUs☆163Updated this week
- Tools to deploy GPU clusters in the Cloud☆30Updated last year
- ☆225Updated last month
- The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.☆62Updated this week
- Common source, scripts and utilities shared across all Triton repositories.☆62Updated 2 weeks ago
- markdown docs☆62Updated this week
- Run cloud native workloads on NVIDIA GPUs☆124Updated 2 weeks ago
- The Triton backend for the ONNX Runtime.☆122Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆52Updated this week
- Intelligent platform for AI workloads☆37Updated last year
- Distributed Model Serving Framework☆147Updated 2 weeks ago
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆21Updated 2 months ago