kubeflow / trainer
Distributed ML Training and Fine-Tuning on Kubernetes
☆1,775Updated this week
Alternatives and similar repositories for trainer:
Users that are interested in trainer are comparing it to the libraries listed below
- Automated Machine Learning on Kubernetes☆1,573Updated last week
- A CLI for Kubeflow.☆769Updated 3 weeks ago
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,091Updated last year
- Kubeflow Deployment Manifests☆912Updated this week
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆475Updated 3 weeks ago
- PyTorch on Kubernetes☆309Updated 3 years ago
- Machine Learning Pipelines for Kubeflow☆3,805Updated last week
- A Cloud Native Batch System (Project under CNCF)☆4,627Updated this week
- NVIDIA device plugin for Kubernetes☆3,196Updated this week
- Standardized Serverless ML Inference Platform on Kubernetes☆4,142Updated this week
- Machine Learning Toolkit for Kubernetes☆14,927Updated 3 weeks ago
- Information about the Kubeflow community including proposals and governance information.☆173Updated 2 weeks ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆521Updated last year
- A toolkit to run Ray applications on Kubernetes☆1,725Updated this week
- GPU Sharing Scheduler for Kubernetes Cluster☆1,463Updated last year
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆644Updated last month
- A CLI-supported framework that streamlines writing and deployment of Kubernetes configurations to multiple clusters.☆1,161Updated 6 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆480Updated 2 years ago
- Event-driven application platform for Kubernetes☆1,458Updated last week
- ☆875Updated last year
- Python SDK for building, training, and deploying ML models☆337Updated 3 years ago
- A repository to host extended examples and tutorials☆1,433Updated 3 weeks ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆466Updated last year
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,514Updated this week
- Kubeflow’s superfood for Data Scientists☆633Updated 2 years ago
- GPU plugin to the node feature discovery for Kubernetes☆300Updated 11 months ago
- Kubernetes-native Job Queueing☆1,764Updated this week
- Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Ap…☆936Updated 7 months ago
- General-Purpose Kubernetes Pod Controller☆175Updated 2 years ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆1,195Updated last week