monte-carlo-data / data-downtime-challengeLinks
☆93Updated 2 years ago
Alternatives and similar repositories for data-downtime-challenge
Users that are interested in data-downtime-challenge are comparing it to the libraries listed below
Sorting:
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆89Updated 4 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 4 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- Data engineering with dbt, published by Packt☆87Updated 3 months ago
- Repository of sample Databricks notebooks☆272Updated last year
- ⭕️ Data Engineering for Data Scientists☆78Updated 2 years ago
- Code repository for the "PySpark in Action" book☆211Updated 5 months ago
- Data Engineering with Spark and Delta Lake☆105Updated 2 years ago
- ☆36Updated 3 years ago
- ☆28Updated 3 years ago
- ☆190Updated 4 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆165Updated 4 years ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- Snowflake Cookbook, published by Packt☆82Updated 2 years ago
- ☆143Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!☆153Updated 3 years ago
- Code snippets for Data Engineering Design Patterns book☆278Updated 8 months ago
- Capturing model drift and handling its response - Example webinar☆108Updated 6 years ago
- ☆88Updated 3 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 5 years ago
- Guide for databricks spark certification☆59Updated 4 years ago
- how to unit test your PySpark code☆29Updated 4 years ago
- Duke MIDS: Data Engineering and DataOps Course☆67Updated 10 months ago
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆126Updated 4 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆87Updated 2 years ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆56Updated 5 months ago
- Example repo to kickstart integration with mlflow pipelines.☆77Updated 3 years ago
- ☆120Updated 4 months ago