microsoft / openpai-runtimeLinks
Runtime for deep learning workload
☆21Updated 3 years ago
Alternatives and similar repositories for openpai-runtime
Users that are interested in openpai-runtime are comparing it to the libraries listed below
Sorting:
- OpenPAI SDK☆19Updated 3 years ago
- Deep Learning Workspace☆204Updated 2 years ago
- A marketplace which stores examples and job templates of openpai. Users could use openpaimarketplace to share their jobs or run-and-learn…☆33Updated 3 years ago
- Extension to connect OpenPAI clusters, submit AI jobs, simulate jobs locally, manage files, and so on.☆15Updated 3 years ago
- General-Purpose Kubernetes Pod Controller☆175Updated 2 years ago
- PyTorch on Kubernetes☆309Updated 4 years ago
- Repo for publishing code Samples and CLI samples for BatchAI service☆126Updated 6 years ago
- Kubernetes Scheduler for Deep Learning☆262Updated 3 years ago
- Fault-tolerant for DL frameworks☆70Updated 2 years ago
- Repository for batch predict☆17Updated 4 years ago
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Benchmarking Horovod and TF on Batch AI☆26Updated 6 years ago
- MLOS is a project to enable autotuning for systems.☆168Updated 2 weeks ago
- Tutorials on running distributed deep learning on Batch AI☆25Updated 7 years ago
- A Kubernetes operator for mxnet jobs☆52Updated 4 years ago
- 👩🔬 Train and Serve TensorFlow Models at Scale with Kubernetes and Kubeflow on Azure☆289Updated 5 years ago
- Incubating project for xgboost operator☆77Updated 4 years ago
- Repository for assets related to Metadata.☆123Updated 4 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆158Updated 2 months ago
- Code for the neural architecture search methods contained in the paper Efficient Forward Neural Architecture Search☆112Updated 2 years ago
- Azure Machine Learning for Visual Studio Code, previously called Visual Studio Code Tools for AI, is an extension to easily build, train,…☆340Updated 3 months ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆175Updated 2 years ago
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53Updated 2 years ago
- Distributed ML Optimizer☆35Updated 4 years ago
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆393Updated last year
- [Deprecated] The TensorFlow Profiler (TFProf) UI provides a visual interface for profiling TensorFlow models.☆137Updated 6 years ago
- Fork of NVIDIA device plugin for Kubernetes with support for shared GPUs by declaring GPUs multiple times☆87Updated 3 years ago
- Development repository for integrating FlexFlow (A distributed deep learning framework that supports flexible parallelization strategies)…☆29Updated 4 years ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆484Updated 2 months ago
- Kernel for Kubeflow in Jupyter Notebook☆65Updated 6 years ago