4paradigm / openaios-platform
OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development platform built upon OpenAIOS for enterprises to develop and deploy AI applications for production.
☆93Updated 3 years ago
Related projects: ⓘ
- Cloud Native ML/DL Platform☆127Updated 4 years ago
- A CLI for Kubeflow.☆58Updated 8 months ago
- kubeflow国内一键安装文件☆338Updated 2 years ago
- ☆112Updated last month
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆260Updated last year
- ☆32Updated 3 years ago
- A flexible, high-performance serving system for machine learning models☆138Updated 2 years ago
- Cloud Native Machine Learning Model Registry☆80Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆498Updated 6 months ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆45Updated 3 years ago
- ☆193Updated last year
- demo applications that show how to deploy offline feature engineering solutions to online in one minute with fedb and nativespark☆35Updated last year
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆118Updated 2 years ago
- Fault-tolerant for DL frameworks☆68Updated last year
- ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部…☆180Updated 4 years ago
- Kubeflow helm chart☆136Updated last year
- 一种任务级GPU算力分时调度的高性能深度学习训练平台☆291Updated 10 months ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆30Updated last year
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆489Updated 3 months ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆84Updated last month
- Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints☆144Updated last week
- FastNN provides distributed training examples that use EPL.☆81Updated 2 years ago
- A stand alone industrial serving system for angel.☆62Updated 2 years ago
- ☆18Updated last year
- Elastic Deep Learning for deep learning framework on Kubernetes☆170Updated last year
- ☆266Updated last year
- Automatic tuning for ML model deployment on Kubernetes☆78Updated 4 months ago
- Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …☆68Updated 2 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆193Updated 2 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆31Updated last year