A Kubernetes operator for mxnet jobs
☆52Dec 1, 2021Updated 4 years ago
Alternatives and similar repositories for mxnet-operator
Users that are interested in mxnet-operator are comparing it to the libraries listed below
Sorting:
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- Studying GPU Multi-tenancy☆11Jan 11, 2019Updated 7 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Tools for ML/MXNet on Kubernetes.☆44Feb 11, 2018Updated 8 years ago
- Experimental repository for a caffe2 operator☆16Dec 1, 2021Updated 4 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Kernel for Kubeflow in Jupyter Notebook☆65Aug 13, 2019Updated 6 years ago
- ☆32Jun 15, 2021Updated 4 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆516Updated this week
- Dynamic training with Apache MXNet reduces cost and time for training deep neural networks by leveraging AWS cloud elasticity and scale. …☆56Nov 25, 2022Updated 3 years ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Aug 6, 2020Updated 5 years ago
- PyTorch on Kubernetes☆309Dec 1, 2021Updated 4 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Seldon Core Operator for Kubernetes☆13Nov 5, 2019Updated 6 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Information about the Kubeflow community including proposals and governance information.☆183Feb 5, 2026Updated 3 weeks ago
- ☆132Apr 19, 2021Updated 4 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 3 years ago
- A collection of common util libraries for Go☆25Oct 25, 2020Updated 5 years ago
- Incubating project for xgboost operator☆76Dec 1, 2021Updated 4 years ago
- 👩🔬[Experimental] Easily train and serve ML models on Kubernetes, directly from your python code.☆31Nov 8, 2018Updated 7 years ago
- the hadoop plugin for chdfs☆14Updated this week
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Feb 23, 2022Updated 4 years ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆17Sep 27, 2021Updated 4 years ago
- The scheduler of Volcano, built based on kubernetes-sigs/kube-batch☆14Jul 7, 2019Updated 6 years ago
- [WIP] Open Source WakaTime Server☆14Feb 4, 2019Updated 7 years ago
- Volume Controller for Kubernetes☆67Jan 3, 2023Updated 3 years ago
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆51May 5, 2019Updated 6 years ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,035Updated this week
- Logging MXNet data for visualization in TensorBoard.☆324Nov 30, 2021Updated 4 years ago
- Machine Learning Inference Graph Spec☆21Jul 27, 2019Updated 6 years ago
- Full instructions for repainting the past☆19May 30, 2019Updated 6 years ago
- GluonNLP tutorial for Pycon2019☆14Aug 16, 2019Updated 6 years ago
- GPU analyzer for Kubernetes GPU clusters☆17Apr 11, 2020Updated 5 years ago
- ☆37May 19, 2019Updated 6 years ago
- GPU Sharing Device Plugin for Kubernetes Cluster☆492Jan 10, 2023Updated 3 years ago