Netflix / metaflow-extensions-template
☆15Updated last year
Alternatives and similar repositories for metaflow-extensions-template:
Users that are interested in metaflow-extensions-template are comparing it to the libraries listed below
- Deploy production-grade Metaflow cloud infrastructure on AWS☆63Updated 2 months ago
- ☆22Updated 2 months ago
- A web API for dbt.☆109Updated 3 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆197Updated 2 months ago
- Metadata tracking and UI service for Metaflow!☆196Updated this week
- Tools and utilities for operating Metaflow in production☆52Updated this week
- Ray provider for Apache Airflow☆47Updated last year
- Use pyarrow with Azure Data Lake gen2☆26Updated 8 months ago
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆41Updated 10 months ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆146Updated this week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆139Updated last month
- Ray integration for Dagster☆36Updated this week
- dbt adapter for Athena☆38Updated 9 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Utility functions for dbt projects running on Spark☆31Updated last month
- This is the main repository for SDF documentation found at docs.sdf.com, as well as public schemas, benchmarks, and examples☆116Updated last month
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- Unity Catalog UI☆39Updated 6 months ago
- pytest plugin to run the tests with support of pyspark☆85Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆76Updated this week
- Ray-based Apache Beam runner☆43Updated last year
- ✨ A Pydantic to PySpark schema library☆72Updated this week
- ☆55Updated last year
- Define, govern, and model event data for warehouse-first product analytics.☆82Updated 8 months ago
- ☆19Updated 3 months ago
- Redshift Python Connector. It supports Python Database API Specification v2.0.☆209Updated 2 months ago
- Deployment tools/scripts for Metaflow!☆56Updated last year
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆99Updated 10 months ago