Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆201May 3, 2026Updated last week
Alternatives and similar repositories for grove
Users that are interested in grove are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆157Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,255Updated this week
- ☆302Apr 30, 2026Updated last week
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆58Updated this week
- ☆248Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆77Apr 14, 2026Updated 3 weeks ago
- A toolkit for discovering cluster network topology.☆124Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆715Updated this week
- NVIDIA Inference Xfer Library (NIXL)☆1,022Updated this week
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆123Apr 30, 2026Updated last week
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Manage ML configuration with pydantic☆16Mar 18, 2026Updated last month
- Workshop materials for AI Engineer World's Fair☆16Jun 3, 2025Updated 11 months ago
- High-performance distributed data shuffling (all-to-all) library for MoE training and inference☆120Mar 7, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- WG Serving☆35Mar 24, 2026Updated last month
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆30Apr 4, 2025Updated last year
- ☆19Apr 12, 2026Updated 3 weeks ago
- Gateway API Inference Extension☆660May 2, 2026Updated last week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 11 months ago
- Agent skills for vLLM☆67Apr 3, 2026Updated last month
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 8 months ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆29Mar 20, 2026Updated last month
- A high-performance and light-weight router for vLLM large scale deployment☆214Apr 30, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Distributed KV cache scheduling & offloading libraries☆140May 1, 2026Updated last week
- DRA Driver for NVIDIA GPUs☆637Updated this week
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 10 months ago
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated 3 months ago
- Simple tokenised template system for SGE☆10Mar 1, 2023Updated 3 years ago
- Kubernetes KMS implementation☆27Apr 24, 2026Updated 2 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆51Updated this week
- Documentation repository for NVIDIA Cloud Native Technologies☆38Updated this week
- GPU Environment Management for JupyterLab☆26Feb 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆3,148Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- Operator for the mutating admission webhook for ClusterResourceOverride☆19Apr 15, 2026Updated 3 weeks ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆181Feb 11, 2026Updated 2 months ago
- markdown docs☆96Feb 1, 2026Updated 3 months ago
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- ☆16Updated this week