sql-machine-learning / elasticdl
Kubernetes-native Deep Learning Framework
☆739Updated last year
Alternatives and similar repositories for elasticdl:
Users that are interested in elasticdl are comparing it to the libraries listed below
- Cloud Native ML/DL Platform☆133Updated 4 years ago
- A CLI for Kubeflow.☆763Updated this week
- Elastic Deep Learning for deep learning framework on Kubernetes☆172Updated last year
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆267Updated 2 years ago
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆517Updated last year
- Fault-tolerant for DL frameworks☆70Updated last year
- ☆323Updated last week
- A stand alone industrial serving system for angel.☆62Updated 2 years ago
- ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部…☆181Updated 4 years ago
- ☆372Updated 7 years ago
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆157Updated 11 months ago
- ☆214Updated last year
- deepx_core是一个专注于张量计算/深度学习的基础库☆375Updated last year
- A CLI for Kubeflow.☆60Updated last year
- ☆205Updated last year
- Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosop…☆674Updated 5 years ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆533Updated 2 years ago
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆982Updated last week
- DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foun…☆1,087Updated 2 months ago
- General-Purpose Kubernetes Pod Controller☆175Updated 2 years ago
- Bagua Speeds up PyTorch☆879Updated 8 months ago
- FastNN provides distributed training examples that use EPL.☆83Updated 3 years ago
- A flexible, high-performance serving system for machine learning models☆142Updated 3 years ago
- Read and write Tensorflow TFRecord data from Apache Spark.☆292Updated 11 months ago
- Common APIs and libraries shared by other Kubeflow operator repositories.☆52Updated last year
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆401Updated this week
- A high performance and generic framework for distributed DNN training☆3,671Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 3 years ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆467Updated last year
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆97Updated 3 years ago