sizrailev / life-around-data-codeLinks
Code snippets and tools published on the blog at lifearounddata.com
☆12Updated 5 years ago
Alternatives and similar repositories for life-around-data-code
Users that are interested in life-around-data-code are comparing it to the libraries listed below
Sorting:
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 10 months ago
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆31Updated 3 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- ☆21Updated 4 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- Cloned by the `dbt init` task☆61Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- learning-by-doing data model built with dbt-core☆13Updated 5 months ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆20Updated 7 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- Machine Learning in Snowflake☆24Updated 5 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- ☆28Updated last year
- ☆16Updated 2 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- A proof of concept for how to set up a codebase for an analytics org.☆14Updated 3 years ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 4 years ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆52Updated 6 months ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆27Updated 2 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Updated 4 years ago
- ☆75Updated 2 weeks ago
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆21Updated 2 years ago
- A Python API for Asynchronously Loading Data into Snowflake DB -☆64Updated 6 months ago
- Example repo to create end to end tests for data pipeline.☆24Updated 11 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆109Updated this week
- Orchestrating ELT using the modern data stack☆10Updated 3 years ago
- This is a simple analytic project using DuckDB & dbt with air quality data.☆20Updated last year