teamdatatonic / dopLinks
Data Orchestration Platform
☆64Updated 4 years ago
Alternatives and similar repositories for dop
Users that are interested in dop are comparing it to the libraries listed below
Sorting:
- ☆82Updated 4 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- ☆47Updated last year
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆170Updated 2 weeks ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Great Expectations Airflow operator☆170Updated last week
- Write python locally, execute SQL in your data warehouse☆268Updated 3 years ago
- Utility to compare data between homogeneous or heterogeneous environments to ensure source and target tables match☆493Updated this week
- Make simple storing test results and visualisation of these in a BI dashboard☆52Updated last month
- Data Quality Engine for BigQuery☆278Updated 8 months ago
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 8 months ago
- Airflow Unit Tests and Integration Tests☆261Updated 3 years ago
- Data lake, data warehouse on GCP☆58Updated 4 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆268Updated 10 months ago
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- Making DAG construction easier☆283Updated last month
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated 4 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-datatransfer☆84Updated 2 years ago
- Package for dbt that allows users to train, audit and use BigQuery ML models.☆77Updated 2 months ago
- ☆87Updated 3 years ago
- Astronomer Core Docker Images☆105Updated last year
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-bigquery-storage☆120Updated 4 months ago
- ☆130Updated last year
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆45Updated this week
- ☆47Updated last year
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆76Updated last year
- ☆201Updated 2 years ago
- Apache Airflow integration for dbt☆411Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆90Updated 4 years ago
- re_data - fix data issues before your users & CEO would discover them 😊☆101Updated last year