monte-carlo-data / data-downtime-challenge
☆84Updated 2 years ago
Alternatives and similar repositories for data-downtime-challenge:
Users that are interested in data-downtime-challenge are comparing it to the libraries listed below
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 8 months ago
- ☆27Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Snowflake Cookbook, published by Packt☆78Updated 2 years ago
- Data engineering with dbt, published by Packt☆76Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Code snippets for Data Engineering Design Patterns book☆74Updated last month
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago
- Weekly Data Engineering Newsletter☆94Updated 8 months ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- ☆36Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- An example MLFlow project☆48Updated 2 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- ☆33Updated last year
- ☆74Updated 5 months ago
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Updated last week
- Code repository for the "PySpark in Action" book☆194Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- ☆39Updated 3 years ago
- ⭕️ Data Engineering for Data Scientists☆77Updated 2 years ago