oap-project / cloudtikLinks
Cloud Scale Platform for Distributed Analytics and AI
☆24Updated last year
Alternatives and similar repositories for cloudtik
Users that are interested in cloudtik are comparing it to the libraries listed below
Sorting:
- Boston University Collaboratory project for applying AI to cloud operations☆12Updated 2 years ago
- ☆15Updated 3 weeks ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Updated last year
- Cloud-based AI / ML workflow and data application development framework☆17Updated last year
- A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer☆51Updated last year
- Pafka is originated from the OpenAIOS project to leverage an optimized tiered storage access strategy to improve overall performance for …☆67Updated 3 years ago
- Holistic job manager on Kubernetes☆116Updated last year
- A tool to detect infrastructure issues on cloud native AI systems☆47Updated 3 weeks ago
- Apache Yunikorn website - see the master branch for instructions☆29Updated 3 weeks ago
- Apache YuniKorn Web UI☆36Updated 3 weeks ago
- Apache YuniKorn Release☆43Updated last month
- Apache YuniKorn K8shim☆158Updated 3 weeks ago
- cricket is a virtualization solution for GPUs☆215Updated last month
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆82Updated last week
- A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.☆29Updated last week
- This is archive of SparkRDMA project. The new repository with RDMA shuffle acceleration for Apache Spark is here: https://github.com/Nvid…☆252Updated 6 years ago
- KNoC is a Kubernetes Virtual Kubelet that uses an HPC cluster as the container execution environment☆21Updated 2 years ago
- Spark Shuffle Optimization with RDMA+AEP☆30Updated 2 years ago
- NVIDIA GPUDirect Storage Driver☆285Updated last month
- General-Purpose Kubernetes Pod Controller☆175Updated 2 years ago
- Mobius is an AI infrastructure platform for distributed online learning, including online sample processing, training and serving.☆100Updated last year
- A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)☆215Updated 2 months ago
- A modular acceleration toolkit for big data analytic engines☆67Updated last year
- Apache YuniKorn Scheduler Interface☆32Updated last month
- DAOS Storage Stack (client libraries, storage engine, control plane)☆878Updated this week
- A validation and profiling tool for AI infrastructure☆338Updated this week
- Persistent Memory Container Storage Interface Driver☆163Updated 11 months ago
- MLPerf® Storage Benchmark Suite☆162Updated last week
- RDMA-enabled Apache Kafka☆20Updated 3 years ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year