anyscale / airflow-provider-rayLinks
Ray provider for Apache Airflow
☆47Updated last year
Alternatives and similar repositories for airflow-provider-ray
Users that are interested in airflow-provider-ray are comparing it to the libraries listed below
Sorting:
- Distributed XGBoost on Ray☆152Updated last year
- Ray-based Apache Beam runner☆42Updated 2 years ago
- A portable Multimodal Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to you…☆261Updated this week
- ☆31Updated 4 years ago
- Native Kubernetes integration for Dask☆324Updated 2 months ago
- Flyte Documentation 📖☆84Updated 8 months ago
- Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.☆300Updated this week
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆301Updated last year
- Apache Avro <-> pandas DataFrame☆138Updated 3 months ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.☆355Updated this week
- Docker images for dask☆244Updated last week
- ByteHub: making feature stores simple☆61Updated 4 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆145Updated 2 months ago
- MLOps Python Library☆120Updated 3 years ago
- Deploy dask on YARN clusters☆69Updated last year
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 9 months ago
- Unified specification for defining and executing ML workflows, making reproducibility, consistency, and governance easier across the ML p…☆94Updated last year
- FlorDB 🌻☆156Updated 2 months ago
- Magniv Core - A Python-decorator based job orchestration platform. Avoid responsibility handoffs by abstracting infra and DevOps.☆81Updated last year
- A JupyterLab extension for browsing S3-compatible object storage☆132Updated 3 months ago
- Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.☆51Updated 3 years ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- Distributed SQL Engine in Python using Dask☆409Updated last year
- yogadl, the flexible data layer☆74Updated 2 years ago
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆145Updated last year
- Concept drift monitoring for HA model servers.☆101Updated 2 years ago
- The Prefect API and backend☆248Updated 2 years ago
- Joblib Apache Spark Backend☆249Updated 8 months ago
- Code examples showing flow deployment to various types of infrastructure☆111Updated 2 years ago