sizrailev / life-around-data-codeLinks
Code snippets and tools published on the blog at lifearounddata.com
☆12Updated 5 years ago
Alternatives and similar repositories for life-around-data-code
Users that are interested in life-around-data-code are comparing it to the libraries listed below
Sorting:
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆32Updated 4 years ago
- Cloned by the `dbt init` task☆62Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆121Updated 2 years ago
- Rules based grant management for Snowflake☆41Updated 6 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆113Updated 2 months ago
- New generation opensource data stack☆73Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆19Updated 4 months ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆131Updated 3 years ago
- ☆34Updated 5 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆88Updated 2 years ago
- ☆10Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆59Updated last year
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- ☆80Updated 11 months ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- ☆156Updated 2 months ago