elasticdeeplearning / edl
Elastic Deep Learning for deep learning framework on Kubernetes
☆170Updated last year
Related projects: ⓘ
- Fault-tolerant for DL frameworks☆68Updated last year
- Common APIs and libraries shared by other Kubeflow operator repositories.☆51Updated last year
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆31Updated last year
- A Kubernetes operator for mxnet jobs☆53Updated 2 years ago
- Kubernetes Scheduler for Deep Learning☆252Updated 2 years ago
- General-Purpose Kubernetes Pod Controller☆173Updated last year
- Automatic tuning for ML model deployment on Kubernetes☆78Updated 4 months ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆260Updated last year
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated last month
- ☆11Updated this week
- Cloud Native ML/DL Platform☆127Updated 4 years ago
- ☆53Updated 4 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆498Updated 6 months ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆45Updated 3 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆118Updated 2 years ago
- PyTorch on Kubernetes☆305Updated 2 years ago
- Cloud Native Machine Learning Model Registry☆80Updated last year
- Elastic Serverless Serving based on Kubernetes, provides 0 instance serving capability.☆10Updated 2 years ago
- GPU-scheduler-for-deep-learning☆192Updated 3 years ago
- ☆50Updated 11 months ago
- Repository for benchmarking☆77Updated 3 months ago
- Incubating project for xgboost operator☆76Updated 2 years ago
- Kernel for Kubeflow in Jupyter Notebook☆67Updated 5 years ago
- alibabacloud-aiacc-demo☆42Updated last year
- Kubernetes-native Deep Learning Framework☆732Updated 7 months ago
- Resource-adaptive cluster scheduler for deep learning training.☆422Updated last year
- ☆126Updated 3 years ago
- ☆205Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆276Updated 2 years ago
- Device plugins for Volcano, e.g. GPU☆98Updated this week