silogen / kaiwoLinks
AI Workload Orchestrator for Kubernetes
☆18Updated last week
Alternatives and similar repositories for kaiwo
Users that are interested in kaiwo are comparing it to the libraries listed below
Sorting:
- Kubernetes operator which sets up all platform tools to have a cluster ready for applications to run.☆17Updated last week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆412Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆216Updated last week
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆158Updated 2 months ago
- A top-like tool for monitoring GPUs in a cluster☆84Updated last year
- 🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.☆158Updated last month
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆88Updated 2 months ago
- Module, Model, and Tensor Serialization/Deserialization☆286Updated 5 months ago
- Ray-based Apache Beam runner☆42Updated 2 years ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆503Updated this week
- Tracking Ray Enhancement Proposals☆63Updated last month
- Ray provider for Apache Airflow☆47Updated 2 years ago
- Utility for measuring the fraction of time the CPython GIL is held☆122Updated 4 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆482Updated 2 months ago
- CUDA checkpoint and restore utility☆410Updated 4 months ago
- Repository for open inference protocol specification☆64Updated 8 months ago
- A module for lazy loading of Python modules☆88Updated 2 years ago
- ClearML - Model-Serving Orchestration and Repository Solution☆161Updated last month
- fsspec filesystem for Alibaba Cloud (Aliyun) Object Storage System (OSS)☆25Updated 9 months ago
- Provide Python access to the NVML library for GPU diagnostics☆258Updated 5 months ago
- GPU environment and cluster management with LLM support☆656Updated last year
- ☆44Updated this week
- Python bindings for UCX☆139Updated 4 months ago
- Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage pa…☆36Updated last week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆267Updated last week
- cuVS - a library for vector search and clustering on the GPU☆624Updated this week
- Container plugin for Slurm Workload Manager☆412Updated 3 weeks ago
- The Triton backend for the ONNX Runtime.☆172Updated this week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆658Updated 2 months ago
- ☆61Updated last year