michaelchanwahyan / datalab
☆110Updated 4 months ago
Alternatives and similar repositories for datalab:
Users that are interested in datalab are comparing it to the libraries listed below
- Airflow training for the crunch conf☆105Updated 6 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 5 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆116Updated 2 years ago
- Docker image for dbt (data build tool).☆49Updated 3 years ago
- dagster scikit-learn pipeline example.☆44Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆87Updated 4 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆126Updated 2 years ago
- Load data from redshift into a pandas DataFrame and vice versa.☆139Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆108Updated last week
- ☆199Updated last year
- ☆74Updated this week
- Creates simple data models on Snowflake to report dbt source freshness and tests☆26Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- ☆47Updated 3 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆103Updated 7 years ago
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆124Updated 3 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 9 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆162Updated 3 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆196Updated 5 years ago