Netflix / metaflow-extensions-template
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for metaflow-extensions-template
- Deploy production-grade Metaflow cloud infrastructure on AWS☆58Updated 3 months ago
- ☆22Updated 2 weeks ago
- Tools and utilities for operating Metaflow in production☆47Updated 2 months ago
- Metadata tracking and UI service for Metaflow!☆193Updated this week
- A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture t…☆166Updated 2 weeks ago
- dbt adapter for Athena☆39Updated 5 months ago
- Use pyarrow with Azure Data Lake gen2☆25Updated 4 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 6 months ago
- Utility functions for dbt projects running on Spark☆31Updated last year
- A command-line interface for packaging, deploying, and running your EMR Serverless Spark jobs☆39Updated 6 months ago
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆106Updated last year
- A web API for dbt.☆110Updated 9 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated this week
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆134Updated last month
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- ☆18Updated last year
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Ray-based Apache Beam runner☆42Updated last year
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆110Updated 4 months ago
- Black for Databricks notebooks☆44Updated 3 months ago
- All things awesome related to Dagster!☆81Updated this week
- Examples of various flow deployments for Prefect 1.0 (storage and run configurations)☆35Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated last year
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- Unity Catalog UI☆39Updated 2 months ago
- A playground for running duckdb as a stateless query engine over a data lake☆172Updated 10 months ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated last year
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆29Updated last year