anyscale / airflow-provider-ray
Ray provider for Apache Airflow
☆48Updated last year
Alternatives and similar repositories for airflow-provider-ray:
Users that are interested in airflow-provider-ray are comparing it to the libraries listed below
- Ray-based Apache Beam runner☆42Updated last year
- ☆30Updated 3 years ago
- Distributed XGBoost on Ray☆148Updated 10 months ago
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆207Updated this week
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated last month
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆50Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Flyte Documentation 📖☆81Updated last month
- Native Kubernetes integration for Dask☆322Updated 2 weeks ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆79Updated 9 months ago
- Serverless Python with Ray☆55Updated 2 years ago
- MLOps Python Library☆118Updated 3 years ago
- Stream Processing Made Easy☆41Updated 3 years ago
- Unified Distributed Execution☆52Updated 6 months ago
- Distributed SQL Engine in Python using Dask☆404Updated 8 months ago
- Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.☆273Updated this week
- Machine Learning Projects with Flytekit☆36Updated last year
- Docker images for dask☆240Updated last week
- Metadata tracking and UI service for Metaflow!☆201Updated 3 weeks ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated 5 months ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆141Updated 3 weeks ago
- Pylint plugin for static code analysis on Airflow code☆94Updated 4 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated 10 months ago
- Deploy dask on YARN clusters☆69Updated 8 months ago
- ☆58Updated last year
- Using MLflow with a PostgreSQL Database Tracking URI and a Minio Artifact URI, and MLflow Registry☆12Updated 4 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated last year
- Python client for RedisAI☆89Updated last year
- This library can convert a pydantic class to a avro schema or generate python code from a avro schema.☆72Updated 3 weeks ago