monte-carlo-data / data-downtime-challenge
☆83Updated last year
Related projects ⓘ
Alternatives and complementary repositories for data-downtime-challenge
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago
- ☆25Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Data Engineering with Spark and Delta Lake☆89Updated last year
- Data engineering with dbt, published by Packt☆60Updated 8 months ago
- Snowflake Cookbook, published by Packt☆73Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 2 years ago
- Example repo to kickstart integration with mlflow pipelines.☆73Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- ☆170Updated 3 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated last year
- Guide for databricks spark certification☆58Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆85Updated 3 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆62Updated 4 years ago
- ☆86Updated 2 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆73Updated 5 years ago
- Duke MIDS: Data Engineering and DataOps Course☆59Updated last year
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆25Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Updated 3 weeks ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 2 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆76Updated last year
- ☆29Updated 3 years ago
- Repository for Apache Spark course at Team Data Science☆16Updated 4 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- An example MLFlow project☆48Updated 2 years ago
- Hey this is the repo that has all the queries and data for my video game training series!☆132Updated 2 years ago