Run your deep learning workloads on Kubernetes more easily and efficiently.
☆531Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for kubedl
Users that are interested in kubedl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,892Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- A Cloud Native Batch System (Project under CNCF)☆5,395Mar 19, 2026Updated last week
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,666Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A CLI for Kubeflow.☆810Mar 19, 2026Updated last week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,531Dec 29, 2023Updated 2 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,284Mar 19, 2026Updated last week
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,063Mar 21, 2026Updated last week
- Kubernetes-native Deep Learning Framework☆744Jan 26, 2024Updated 2 years ago
- ☆893Apr 2, 2024Updated last year
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,090May 22, 2023Updated 2 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆493Jan 10, 2023Updated 3 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆519Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Docker for Your ML/DL Models Based on OCI Artifacts☆472Jan 26, 2024Updated 2 years ago
- ☆540Jun 7, 2024Updated last year
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- GPU-scheduler-for-deep-learning☆209Nov 5, 2020Updated 5 years ago
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,206Mar 11, 2026Updated 2 weeks ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago
- Fault-tolerant for DL frameworks☆70Jul 5, 2023Updated 2 years ago
- Kubernetes Scheduler for Deep Learning☆264May 22, 2022Updated 3 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆123Nov 1, 2022Updated 3 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆202Mar 24, 2022Updated 4 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is t…☆543Mar 20, 2026Updated last week
- Device plugins for Volcano, e.g. GPU☆134Mar 20, 2025Updated last year
- OpenYurt - Extending your native Kubernetes to edge(project under CNCF)☆1,948Mar 20, 2026Updated last week
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆33Nov 11, 2023Updated 2 years ago
- Apache YuniKorn Core☆1,005Updated this week
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆358Jul 7, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Kubernetes Rdma SRIOV device plugin☆113Dec 30, 2020Updated 5 years ago
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,362Updated this week
- RDMA CNI plugin for containerized workloads☆60Mar 18, 2026Updated last week
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Mar 31, 2023Updated 2 years ago
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- JuiceFS CSI Driver☆289Updated this week
- Automated Machine Learning on Kubernetes☆1,671Updated this week