A toolkit to run Ray applications on Kubernetes
☆2,341Feb 23, 2026Updated last week
Alternatives and similar repositories for kuberay
Users that are interested in kuberay are comparing it to the libraries listed below
Sorting:
- Kubernetes-native Job Queueing☆2,329Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,340Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,135Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆673Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,516Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,549Updated this week
- Apache YuniKorn Core☆1,002Updated this week
- NVIDIA device plugin for Kubernetes☆3,671Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,754Feb 21, 2026Updated last week
- Cost-efficient and pluggable Infrastructure components for GenAI inference☆4,650Updated this week
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,035Updated this week
- Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)☆3,032Feb 18, 2026Updated last week
- Apache YuniKorn K8shim☆163Feb 10, 2026Updated 2 weeks ago
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,890Updated this week
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,660Updated this week
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,314Updated this week
- Machine Learning Toolkit for Kubernetes☆15,462Jan 5, 2026Updated last month
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆368Feb 1, 2026Updated last month
- A Datacenter Scale Distributed Inference Serving Framework☆6,154Updated this week
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated 11 months ago
- Kubebuilder - SDK for building Kubernetes APIs using CRDs☆8,994Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,626Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,144Updated this week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,276Dec 5, 2025Updated 2 months ago
- NVIDIA DRA Driver for GPUs☆574Updated this week
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, o…☆9,478Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆314Updated this week
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,155Feb 23, 2026Updated last week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,730Feb 16, 2026Updated 2 weeks ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,106Updated this week
- KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes☆9,938Updated this week
- Gateway API Inference Extension☆594Updated this week
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆268Feb 18, 2026Updated last week
- Autoscaling components for Kubernetes☆8,773Updated this week
- a unified scheduler for online and offline tasks☆644Mar 26, 2025Updated 11 months ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated last year
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,393Updated this week
- Workflow Engine for Kubernetes☆16,481Updated this week
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,191Updated this week