A Kubernetes operator for mxnet jobs
☆52Dec 1, 2021Updated 4 years ago
Alternatives and similar repositories for mxnet-operator
Users that are interested in mxnet-operator are comparing it to the libraries listed below
Sorting:
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Studying GPU Multi-tenancy☆11Jan 11, 2019Updated 7 years ago
- Tools for ML/MXNet on Kubernetes.☆44Feb 11, 2018Updated 8 years ago
- Experimental repository for a caffe2 operator☆16Dec 1, 2021Updated 4 years ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- Kernel for Kubeflow in Jupyter Notebook☆65Aug 13, 2019Updated 6 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆519Updated this week
- [WIP] Open Source WakaTime Server☆14Feb 4, 2019Updated 7 years ago
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- PyTorch on Kubernetes☆309Dec 1, 2021Updated 4 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Feb 23, 2022Updated 4 years ago
- ☆11May 22, 2017Updated 8 years ago
- GluonNLP tutorial for Pycon2019☆14Aug 16, 2019Updated 6 years ago
- Experimental flow-based Kubernetes scheduler☆34Jan 4, 2018Updated 8 years ago
- ☆31Jun 15, 2021Updated 4 years ago
- Logging MXNet data for visualization in TensorBoard.☆324Nov 30, 2021Updated 4 years ago
- ☆131Apr 19, 2021Updated 4 years ago
- Distributed AI Model Training and LLM Fine-Tuning on Kubernetes☆2,056Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,092May 22, 2023Updated 2 years ago
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- 👩🔬[Experimental] Easily train and serve ML models on Kubernetes, directly from your python code.☆31Nov 8, 2018Updated 7 years ago
- Information about the Kubeflow community including proposals and governance information.☆188Mar 4, 2026Updated 2 weeks ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆25Aug 6, 2020Updated 5 years ago
- A collection of common util libraries for Go☆25Oct 25, 2020Updated 5 years ago
- Volume Controller for Kubernetes☆67Jan 3, 2023Updated 3 years ago
- 碩士論文文獻筆記(Deep Learning、Scheduling、Distributed、Kubernetes)☆51May 5, 2019Updated 6 years ago
- Implemention of Capsule Net from the paper Dynamic Routing Between Capsules☆24Nov 12, 2017Updated 8 years ago
- Automatic tuning for ML model deployment on Kubernetes☆80Nov 1, 2024Updated last year
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Incubating project for xgboost operator☆77Dec 1, 2021Updated 4 years ago
- The DayTrader 3 benchmark sample, which is a Java EE 6 application built around the paradigm of an online stock trading system.☆11Nov 18, 2019Updated 6 years ago
- ☆123Nov 1, 2022Updated 3 years ago
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆122Dec 8, 2025Updated 3 months ago
- benchmark-for-spark☆18May 7, 2025Updated 10 months ago
- ☆23Mar 8, 2016Updated 10 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 3 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Resource-adaptive cluster scheduler for deep learning training.☆453Mar 5, 2023Updated 3 years ago