OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development platform built upon OpenAIOS for enterprises to develop and deploy AI applications for production.
☆99Aug 20, 2021Updated 4 years ago
Alternatives and similar repositories for openaios-platform
Users that are interested in openaios-platform are comparing it to the libraries listed below
Sorting:
- OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow app…☆584May 21, 2024Updated last year
- OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and infe…☆1,681Updated this week
- Elastic Deep Learning for deep learning framework on Kubernetes☆175Jul 5, 2023Updated 2 years ago
- OpenEmbedding is an open source framework for Tensorflow distributed training acceleration.☆33Apr 13, 2023Updated 2 years ago
- demo applications that show how to deploy offline feature engineering solutions to online in one minute with fedb and nativespark☆35Oct 15, 2024Updated last year
- 各种深度学习(DL)框架分布式训练,包括:Tensorflow、Tensorflow2、Pytorch、Chainer、Caffe、Mxnet ...☆22Aug 8, 2020Updated 5 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 3 years ago
- The DGL Operator makes it easy to run Deep Graph Library (DGL) graph neural network training on Kubernetes☆44Sep 15, 2021Updated 4 years ago
- A Kubernetes operator for mxnet jobs☆52Dec 1, 2021Updated 4 years ago
- New Repo: https://github.com/byzer-org/kolo-lang☆12Dec 16, 2021Updated 4 years ago
- the hadoop plugin for chdfs☆14Feb 27, 2026Updated last week
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Product roadmap for Alibaba Cloud Container Services including ACK, ACR, ASK - Serverless K8S, ACK@Edge and ASM - Service Mesh☆33Nov 15, 2021Updated 4 years ago
- A collection of example for learning how to use Golang.☆14May 4, 2019Updated 6 years ago
- GPU analyzer for Kubernetes GPU clusters☆17Apr 11, 2020Updated 5 years ago
- Repository for chainer operator☆17Nov 14, 2021Updated 4 years ago
- NebulaGraph DGL(Deep Graph Library) Integration Package. (WIP)☆38Mar 14, 2024Updated last year
- Helpers for dealing with python.subprocess.Popen and paramiko.☆18Updated this week
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- benchmark-for-spark☆18May 7, 2025Updated 10 months ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆271Mar 31, 2023Updated 2 years ago
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- alibabacloud-aiacc-demo☆43May 4, 2023Updated 2 years ago
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Jan 5, 2023Updated 3 years ago
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆20Feb 25, 2023Updated 3 years ago
- 滴滴云推理服务的 HTTP 客户端示例代码☆21Nov 21, 2022Updated 3 years ago
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- Face presentation attack detection (aka face anti-spoofing, face spoofing detection or face liveness detection) using guided scale textur…☆17Apr 5, 2021Updated 4 years ago
- AutoX is an efficient automl tool, which is mainly aimed at data mining tasks with tabular data.☆549Feb 14, 2023Updated 3 years ago
- High performance NCCL plugin for Bagua.☆15Sep 15, 2021Updated 4 years ago
- Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model☆23Sep 13, 2024Updated last year
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆202Mar 24, 2022Updated 3 years ago
- QDrant docker-compose deployment with basic auth/nginx proxy☆23Apr 12, 2023Updated 2 years ago
- Single Path One-Shot NAS MXNet implementation with Supernet training and searching☆19Dec 23, 2019Updated 6 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- Prophecis is a one-stop cloud native machine learning platform.☆511Mar 28, 2025Updated 11 months ago