Netflix / metaflow-extensions-template
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for metaflow-extensions-template
- ☆22Updated last month
- Tools and utilities for operating Metaflow in production☆48Updated 2 months ago
- Deploy production-grade Metaflow cloud infrastructure on AWS☆58Updated 2 months ago
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆159Updated this week
- Ray provider for Apache Airflow☆47Updated 9 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆134Updated 3 weeks ago
- Metadata tracking and UI service for Metaflow!☆192Updated this week
- Unity Catalog UI☆39Updated 2 months ago
- [Project moved] Polars integration for Dagster☆37Updated 7 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆42Updated 9 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 5 months ago
- A data modelling layer built on top of polars and pydantic☆197Updated last year
- Ray-based Apache Beam runner☆42Updated last year
- fsspec-compatible Azure Datake and Azure Blob Storage access☆178Updated 2 months ago
- ☆54Updated 10 months ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- An fsspec implementation for the lakeFS project☆39Updated last week
- Slack bot for monitoring your Metaflow flows!☆27Updated 3 years ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆93Updated last month
- Native Kubernetes integration for Dask☆311Updated this week
- Dask integration for Snowflake☆30Updated 4 months ago
- ✨ A Pydantic to PySpark schema library☆55Updated this week
- RFC document, tooling and other content related to the dataframe API standard☆102Updated 7 months ago
- Terraform Provider for Prefect Cloud☆33Updated this week
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆50Updated this week
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- A data modelling layer built on top of polars and pydantic☆309Updated 2 weeks ago
- A playground for running duckdb as a stateless query engine over a data lake☆168Updated 9 months ago
- Pythonic Iceberg REST Catalog☆65Updated last month
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆29Updated last year