k8sp / tutorials
☆16Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for tutorials
- Fault-tolerant for DL frameworks☆69Updated last year
- A Kubernetes operator for mxnet jobs☆53Updated 2 years ago
- ☆62Updated 7 years ago
- SOTA benchmark☆17Updated last year
- RDMA device plugin for Kubernetes☆204Updated 11 months ago
- Elastic Deep Learning for deep learning framework on Kubernetes☆171Updated last year
- Kubernetes Rdma SRIOV device plugin☆109Updated 3 years ago
- ☆42Updated 8 years ago
- The sample code of running TensorFlow in Kubernetes☆23Updated 4 years ago
- Common APIs and libraries shared by other Kubeflow operator repositories.☆51Updated last year
- ☆55Updated 4 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated 3 months ago
- Experimental repository for a caffe2 operator☆16Updated 2 years ago
- A CLI for Kubeflow.☆59Updated 10 months ago
- ☆123Updated 3 years ago
- ☆56Updated 5 years ago
- ☆30Updated last year
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆263Updated last year
- Cloud Native ML/DL Platform☆128Updated 4 years ago
- ☆14Updated 4 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆120Updated 2 years ago
- Some OpenMP like syntax for Go☆23Updated 10 years ago
- alibabacloud-aiacc-demo☆42Updated last year
- Some tensorflow examples☆19Updated 6 years ago
- ☆209Updated last year
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆532Updated 2 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- Simple Dynamic Batching Inference☆145Updated 2 years ago