kubeflow / pytorch-operator
PyTorch on Kubernetes
☆307Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-operator
- Python SDK for building, training, and deploying ML models☆337Updated 2 years ago
- A CLI for Kubeflow.☆740Updated this week
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆88Updated 2 years ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆171Updated last year
- Incubating project for xgboost operator☆76Updated 2 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆440Updated last month
- PyTorch elastic training☆730Updated 2 years ago
- Information about the Kubeflow community including proposals and governance information.☆159Updated 2 weeks ago
- Repository for assets related to Metadata.☆121Updated 2 years ago
- General-Purpose Kubernetes Pod Controller☆174Updated last year
- Common APIs and libraries shared by other Kubeflow operator repositories.☆51Updated last year
- Fault-tolerant for DL frameworks☆69Updated last year
- Kernel for Kubeflow in Jupyter Notebook☆67Updated 5 years ago
- Collection of tools and examples for managing Accelerated workloads in Kubernetes Engine☆215Updated 2 weeks ago
- A Kubernetes operator for mxnet jobs☆53Updated 2 years ago
- Example for end-to-end machine learning on Kubernetes using Kubeflow and Seldon Core☆172Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆510Updated 8 months ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆461Updated 9 months ago
- Repository for benchmarking☆77Updated 5 months ago
- Kubernetes Scheduler for Deep Learning☆255Updated 2 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆471Updated last year
- Automated Machine Learning on Kubernetes☆1,510Updated 2 weeks ago
- 👩🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure☆289Updated 4 years ago
- GPU plugin to the node feature discovery for Kubernetes☆293Updated 5 months ago
- ☆48Updated 6 years ago
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,613Updated this week
- Controller for ModelMesh☆205Updated 4 months ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated 3 months ago