Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆194Apr 14, 2026Updated this week
Alternatives and similar repositories for grove
Users that are interested in grove are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆154Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,233Updated this week
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆53Updated this week
- ☆297Mar 19, 2026Updated last month
- ☆243Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Updated this week
- A toolkit for discovering cluster network topology.☆115Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆697Updated this week
- NVIDIA Inference Xfer Library (NIXL)☆985Updated this week
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆116Updated this week
- Holistic job manager on Kubernetes☆116Feb 20, 2024Updated 2 years ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 4 months ago
- WG Serving☆34Mar 24, 2026Updated 3 weeks ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆29Apr 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Apr 12, 2026Updated last week
- Agent skills for vLLM☆59Apr 3, 2026Updated 2 weeks ago
- Gateway API Inference Extension☆639Apr 10, 2026Updated last week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 10 months ago
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 7 months ago
- A high-performance and light-weight router for vLLM large scale deployment☆198Updated this week
- CPU DRA Driver☆46Apr 11, 2026Updated last week
- Community maintained hardware plugin for vLLM on AWS Neuron☆28Mar 20, 2026Updated 3 weeks ago
- Distributed KV cache scheduling & offloading libraries☆130Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DRA Driver for NVIDIA GPUs☆626Updated this week
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 9 months ago
- Intelligent platform for AI workloads☆37Jan 24, 2023Updated 3 years ago
- A Triton JIT runtime and ffi provider in C++☆32Apr 10, 2026Updated last week
- Simple tokenised template system for SGE☆10Mar 1, 2023Updated 3 years ago
- Kubernetes KMS implementation☆27Apr 9, 2026Updated last week
- Documentation repository for NVIDIA Cloud Native Technologies☆37Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆51Updated this week
- GPU Environment Management for JupyterLab☆26Feb 19, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆3,015Updated this week
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆177Feb 11, 2026Updated 2 months ago
- Operator for the mutating admission webhook for ClusterResourceOverride☆19Apr 8, 2026Updated last week
- markdown docs☆96Feb 1, 2026Updated 2 months ago
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆122Apr 12, 2026Updated last week