A GPU / device extension framework for Kubernetes
☆365Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for KubeGPU
Users that are interested in KubeGPU are comparing it to the libraries listed below
Sorting:
- More Flexible Device Extension Capability in Kubernetes (DevicePlugins++)☆25Jun 12, 2023Updated 2 years ago
- [EOL] A Firmament-based Kubernetes scheduler☆408Jul 19, 2021Updated 4 years ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,094May 22, 2023Updated 2 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆492Jan 10, 2023Updated 3 years ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,035Updated this week
- High performance container overlay networks on Linux. Enabling RDMA (on both InfiniBand and RoCE) and accelerating TCP to bare metal perf…☆632Jun 12, 2023Updated 2 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆248Updated this week
- RDMA device plugin for Kubernetes☆226Dec 15, 2023Updated 2 years ago
- A tool for developers to create cloud-native applications on Kubernetes.☆3,897Jun 12, 2024Updated last year
- Deep Learning Workspace☆204Jul 18, 2023Updated 2 years ago
- Ansible role to deploy NVIDIA driver for GPUs☆12Jan 21, 2020Updated 6 years ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,528Dec 29, 2023Updated 2 years ago
- Architecture and UX design of KAML-D☆14Apr 3, 2018Updated 7 years ago
- Simple, cloud native infrastructure for Kubernetes.☆1,683Oct 5, 2022Updated 3 years ago
- Kernel for Kubeflow in Jupyter Notebook☆65Aug 13, 2019Updated 6 years ago
- Toolkit for creating gRPC-based CLI and web tools for Kubernetes☆72Feb 28, 2018Updated 8 years ago
- Virtual Kubelet is an open source Kubernetes kubelet implementation.☆4,483Feb 16, 2026Updated last week
- Banzai Cloud Pipeline is a solution-oriented application platform which allows enterprises to develop, deploy and securely scale containe…☆1,511Nov 24, 2023Updated 2 years ago
- The hypervisor-based container runtime for Kubernetes.☆676Dec 15, 2020Updated 5 years ago
- This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kube…☆817Oct 3, 2022Updated 3 years ago
- [EOL] Anonymous Usage Collector☆74Dec 21, 2018Updated 7 years ago
- Kubernetes Native Serverless Framework☆6,860Dec 15, 2021Updated 4 years ago
- GPU analyzer for Kubernetes GPU clusters☆17Apr 11, 2020Updated 5 years ago
- NVIDIA device plugin for Kubernetes☆3,671Updated this week
- Lightweight Kubernetes controllers as a service☆791Aug 31, 2020Updated 5 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- Kuberetes etcd network checkpointer☆11Jul 20, 2017Updated 8 years ago
- A socket server which reads events from an event source and forwards them to the user clients when appropriate☆18Feb 18, 2018Updated 8 years ago
- A debugger for Kubernetes applications.☆227Mar 8, 2019Updated 6 years ago
- KubeCon-CloudNativeCon-North-America-2018's slides. / 2018北美CNCF大会PPT。☆159May 22, 2019Updated 6 years ago
- ☆33Jun 11, 2018Updated 7 years ago
- Storage backend for Kubernetes using Go database/sql☆34May 28, 2019Updated 6 years ago
- Production grade Kubernetes controller for managing AWS Services using CRDs☆16Apr 8, 2020Updated 5 years ago
- Machine Learning Toolkit for Kubernetes☆15,462Jan 5, 2026Updated last month
- Sonobuoy is a diagnostic tool that makes it easier to understand the state of a Kubernetes cluster by running a set of Kubernetes conform…☆3,034Nov 25, 2025Updated 3 months ago
- 👩🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure☆289Nov 13, 2020Updated 5 years ago
- Kubernetes Resource Explorer☆135Nov 4, 2018Updated 7 years ago
- Contour is a Kubernetes ingress controller using Envoy proxy.☆3,915Updated this week
- ⚠️(OBSOLETE) Search and discovery UI for Helm Chart repositories☆1,413Jun 21, 2021Updated 4 years ago