Common APIs and libraries shared by other Kubeflow operator repositories.
☆53May 28, 2023Updated 2 years ago
Alternatives and similar repositories for common
Users that are interested in common are comparing it to the libraries listed below
Sorting:
- Incubating project for xgboost operator☆77Dec 1, 2021Updated 4 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆519Updated this week
- A Kubernetes operator for mxnet jobs☆52Dec 1, 2021Updated 4 years ago
- The scheduler of Volcano, built based on kubernetes-sigs/kube-batch☆14Jul 7, 2019Updated 6 years ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 9 months ago
- Fault-tolerant for DL frameworks☆70Jul 5, 2023Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- JPMML-SparkML plugin for converting LightGBM-Spark models to PMML☆43Oct 23, 2021Updated 4 years ago
- Automate Scaffolding R Interfaces to Packages in Other Programming Languages☆28Jul 7, 2023Updated 2 years ago
- Personal Blog☆12Mar 14, 2026Updated last week
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- benchmark-for-spark☆18May 7, 2025Updated 10 months ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,056Updated this week
- Backend server for envd☆21Dec 18, 2023Updated 2 years ago
- GPU analyzer for Kubernetes GPU clusters☆17Apr 11, 2020Updated 5 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Test infrastructure and tooling for Kubeflow.☆62Feb 14, 2025Updated last year
- CRAN Task View: Model Deployment with R☆20Aug 24, 2022Updated 3 years ago
- [WIP] Open Source WakaTime Server☆14Feb 4, 2019Updated 7 years ago
- Repository used to main group ACLs used by Kubeflow developers☆18Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,092May 22, 2023Updated 2 years ago
- Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Ap…☆943Oct 8, 2024Updated last year
- ☆31Jun 15, 2021Updated 4 years ago
- Product roadmap for Alibaba Cloud Container Services including ACK, ACR, ASK - Serverless K8S, ACK@Edge and ASM - Service Mesh☆33Nov 15, 2021Updated 4 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Injector trait as a webhook to inject data into Workload.☆15Apr 14, 2021Updated 4 years ago
- Lightning-fast data access platform designed specifically for AI agents☆30Feb 24, 2026Updated 3 weeks ago
- Information about the Kubeflow community including proposals and governance information.☆188Mar 4, 2026Updated 2 weeks ago
- Kubernetes operator for Bagua distributed training job.☆13Feb 7, 2023Updated 3 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆201Mar 24, 2022Updated 3 years ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Aug 6, 2020Updated 5 years ago
- Kubernetes-native Deep Learning Framework☆744Jan 26, 2024Updated 2 years ago
- Kubernetes Rdma SRIOV device plugin☆113Dec 30, 2020Updated 5 years ago
- A CLI for Kubeflow.☆809Updated this week
- Docker for Your ML/DL Models Based on OCI Artifacts☆472Jan 26, 2024Updated 2 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,281Updated this week
- ☆123Nov 1, 2022Updated 3 years ago