kubedl-io / kubedlView external linksLinks
Run your deep learning workloads on Kubernetes more easily and efficiently.
☆532Mar 4, 2024Updated last year
Alternatives and similar repositories for kubedl
Users that are interested in kubedl are comparing it to the libraries listed below
Sorting:
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)☆1,890Updated this week
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- A Cloud Native Batch System (Project under CNCF)☆5,320Feb 9, 2026Updated last week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,527Dec 29, 2023Updated 2 years ago
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,656Updated this week
- A CLI for Kubeflow.☆809Updated this week
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,271Dec 5, 2025Updated 2 months ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,094May 22, 2023Updated 2 years ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,028Updated this week
- ☆892Apr 2, 2024Updated last year
- Kubernetes-native Deep Learning Framework☆746Jan 26, 2024Updated 2 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆514Updated this week
- GPU Sharing Device Plugin for Kubernetes Cluster☆492Jan 10, 2023Updated 3 years ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- ☆540Jun 7, 2024Updated last year
- GPU-scheduler-for-deep-learning☆210Nov 5, 2020Updated 5 years ago
- Automated management of large-scale applications on Kubernetes (incubating project under CNCF)☆5,177Jan 30, 2026Updated 2 weeks ago
- Kubernetes Scheduler for Deep Learning☆263May 22, 2022Updated 3 years ago
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆204Mar 24, 2022Updated 3 years ago
- Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is t…☆540Feb 9, 2026Updated last week
- Fault-tolerant for DL frameworks☆70Jul 5, 2023Updated 2 years ago
- Cloud Native Machine Learning Model Registry☆82Jan 12, 2023Updated 3 years ago
- Device plugins for Volcano, e.g. GPU☆133Mar 20, 2025Updated 10 months ago
- OpenYurt - Extending your native Kubernetes to edge(project under CNCF)☆1,939Feb 9, 2026Updated last week
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Nov 11, 2023Updated 2 years ago
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,290Updated this week
- Apache YuniKorn Core☆1,002Feb 4, 2026Updated last week
- JuiceFS CSI Driver☆285Feb 8, 2026Updated last week
- GPU plugin to the node feature discovery for Kubernetes☆307May 27, 2024Updated last year
- ☆123Nov 1, 2022Updated 3 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- ☆335Updated this week
- Automated Machine Learning on Kubernetes☆1,656Updated this week
- Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs☆357Jul 7, 2025Updated 7 months ago
- RDMA CNI plugin for containerized workloads☆58Feb 3, 2026Updated last week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 8 months ago