sizrailev / life-around-data-codeLinks
Code snippets and tools published on the blog at lifearounddata.com
☆12Updated 5 years ago
Alternatives and similar repositories for life-around-data-code
Users that are interested in life-around-data-code are comparing it to the libraries listed below
Sorting:
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Cloned by the `dbt init` task☆61Updated last year
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 4 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆118Updated 2 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56Updated 2 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Big Data Demystified meetup and blog examples☆31Updated 11 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆55Updated 4 years ago
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆86Updated 2 years ago
- Read Delta tables without any Spark☆47Updated last year
- The Picnic Data Vault framework.☆129Updated last year
- New generation opensource data stack☆70Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago