A toolkit to run Ray applications on Kubernetes
☆2,388Mar 21, 2026Updated this week
Alternatives and similar repositories for kuberay
Users that are interested in kuberay are comparing it to the libraries listed below
Sorting:
- Apache YuniKorn K8shim☆163Updated this week
- Kubernetes-native Job Queueing☆2,368Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,395Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,799Updated this week
- Apache YuniKorn Core☆1,004Updated this week
- Helm charts for the KubeRay project☆60Mar 11, 2026Updated last week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆682Updated this week
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,216Updated this week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: https…☆6,896Updated this week
- NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes☆2,598Updated this week
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,056Updated this week
- NVIDIA device plugin for Kubernetes☆3,706Updated this week
- Cost-efficient and pluggable Infrastructure components for GenAI inference☆4,682Updated this week
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,266Mar 13, 2025Updated last year
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆370Updated this week
- Heterogeneous GPU Sharing on Kubernetes☆3,110Updated this week
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,665Updated this week
- Machine Learning Toolkit for Kubernetes☆15,527Jan 5, 2026Updated 2 months ago
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,891Updated this week
- A Datacenter Scale Distributed Inference Serving Framework☆6,347Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,181Updated this week
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,336Updated this week
- Tracking Ray Enhancement Proposals☆69Dec 17, 2025Updated 3 months ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆270Mar 12, 2026Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆321Updated this week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,284Updated this week
- NVIDIA GPU metrics exporter for Prometheus leveraging DCGM☆1,648Feb 25, 2026Updated 3 weeks ago
- Kubebuilder - SDK for building Kubernetes APIs using CRDs☆9,036Updated this week
- NVIDIA DRA Driver for GPUs☆585Updated this week
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ cl…☆9,664Updated this week
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…☆1,165Feb 23, 2026Updated 3 weeks ago
- a unified scheduler for online and offline tasks☆649Mar 2, 2026Updated 3 weeks ago
- Gateway API Inference Extension☆609Mar 15, 2026Updated last week
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆2,924Updated this week
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,200Mar 11, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆73,479Updated this week
- ☆341Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,446Updated this week