Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆176Mar 24, 2026Updated this week
Alternatives and similar repositories for grove
Users that are interested in grove are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆152Mar 19, 2026Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,191Updated this week
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆43Updated this week
- ☆290Mar 19, 2026Updated last week
- ☆232Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Jul 18, 2025Updated 8 months ago
- A toolkit for discovering cluster network topology.☆104Mar 21, 2026Updated last week
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆103Mar 19, 2026Updated last week
- Holistic job manager on Kubernetes☆116Feb 20, 2024Updated 2 years ago
- Agent skills for vLLM☆48Mar 3, 2026Updated 3 weeks ago
- NVIDIA Inference Xfer Library (NIXL)☆957Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆682Mar 21, 2026Updated last week
- WG Serving☆34Mar 5, 2026Updated 3 weeks ago
- ☆17Jul 18, 2025Updated 8 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Gateway API Inference Extension☆616Updated this week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 9 months ago
- A high-performance and light-weight router for vLLM large scale deployment☆160Mar 20, 2026Updated last week
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 7 months ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆27Mar 20, 2026Updated last week
- CPU DRA Driver☆36Mar 20, 2026Updated last week
- NVIDIA DRA Driver for GPUs☆593Updated this week
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 8 months ago
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Trivy plugin for OCI referrers☆23May 13, 2024Updated last year
- Documentation repository for NVIDIA Cloud Native Technologies☆37Updated this week
- Simple tokenised template system for SGE☆10Mar 1, 2023Updated 3 years ago
- Kubernetes KMS implementation☆26Mar 20, 2026Updated last week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆51Updated this week
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆2,657Mar 21, 2026Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆172Feb 11, 2026Updated last month
- Operator for the mutating admission webhook for ClusterResourceOverride☆18Mar 13, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- markdown docs☆95Feb 1, 2026Updated last month
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- parse Grid Engine qstat job info list into a list of python dicts☆11May 19, 2022Updated 3 years ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆404Updated this week
- Bridge CEA In-House Batch Environment gives a uniform way to access external Batch scheduling systems.☆16Dec 8, 2025Updated 3 months ago
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago