NVIDIA / ais-etlLinks
Provides for deploying custom ETL containers on AIStore, with subsequent user-defined extraction-transformation-loading in parallel, on the fly and/or offline, locally to user data.
☆16Updated 3 weeks ago
Alternatives and similar repositories for ais-etl
Users that are interested in ais-etl are comparing it to the libraries listed below
Sorting:
- Ray-based Apache Beam runner☆42Updated last year
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated 10 months ago
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel app…☆341Updated 2 weeks ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆143Updated last week
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆93Updated last year
- 🪴 Nebari - your open source data science platform☆299Updated last week
- ☆30Updated 2 years ago
- Utilities for Dask and CUDA interactions☆311Updated last week
- Synchronicity lets you interoperate with asynchronous Python APIs.☆120Updated 2 weeks ago
- Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.☆521Updated 2 years ago
- Flow with FlorDB 🌻☆154Updated last month
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆230Updated last week
- Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.☆280Updated last week
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Updated 2 years ago
- Curated examples and patterns for using Chalk. Use these to build your feature pipelines.☆20Updated last week
- Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.☆233Updated last year
- Chassis turns machine learning models into portable container images that can run just about anywhere.☆86Updated last year
- This repository contains example integrations between Determined and other ML products☆48Updated last year
- Unified storage framework for the entire machine learning lifecycle☆156Updated last year
- A curated list of examples that use Pachyderm to accomplish various tasks.☆63Updated last year
- ☆58Updated last year
- ☆38Updated this week
- JupyterLab extension to provide a Kubeflow specific left area for Notebooks deployment☆18Updated 5 years ago
- Native Kubernetes integration for Dask☆323Updated last week
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- UnionML: the easiest way to build and deploy machine learning microservices☆335Updated last year
- ☆95Updated 2 years ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆141Updated last week
- Unified Distributed Execution☆54Updated 8 months ago