Run your deep learning workloads on Kubernetes more easily and efficiently.
☆531Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for kubedl
Users that are interested in kubedl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,909Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- A Cloud Native Batch System (Project under CNCF)☆5,440Apr 10, 2026Updated last week
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,672Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A CLI for Kubeflow.☆809Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,531Dec 29, 2023Updated 2 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,283Mar 19, 2026Updated 3 weeks ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,081Apr 10, 2026Updated last week
- Kubernetes-native Deep Learning Framework☆745Jan 26, 2024Updated 2 years ago
- ☆893Apr 2, 2024Updated 2 years ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,093May 22, 2023Updated 2 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆493Jan 10, 2023Updated 3 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆524Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- ☆539Jun 7, 2024Updated last year
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- GPU-scheduler-for-deep-learning☆209Nov 5, 2020Updated 5 years ago
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,220Apr 8, 2026Updated last week
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago
- Fault-tolerant for DL frameworks☆71Jul 5, 2023Updated 2 years ago
- Kubernetes Scheduler for Deep Learning☆263May 22, 2022Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆123Nov 1, 2022Updated 3 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆202Mar 24, 2022Updated 4 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is t…☆545Apr 8, 2026Updated last week
- Device plugins for Volcano, e.g. GPU☆135Mar 20, 2025Updated last year
- OpenYurt - Extending your native Kubernetes to edge(project under CNCF)☆1,948Updated this week
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆33Nov 11, 2023Updated 2 years ago
- Apache YuniKorn Core☆1,006Updated this week
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆360Jul 7, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kubernetes Rdma SRIOV device plugin☆113Dec 30, 2020Updated 5 years ago
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,427Updated this week
- RDMA CNI plugin for containerized workloads☆60Mar 31, 2026Updated 2 weeks ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Mar 31, 2023Updated 3 years ago
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- Automated Machine Learning on Kubernetes☆1,679Updated this week
- JuiceFS CSI Driver☆290Apr 8, 2026Updated last week