newfront / spark-moderndataengineeringLinks
The source code for the book Modern Data Engineering with Apache Spark
☆36Updated 2 years ago
Alternatives and similar repositories for spark-moderndataengineering
Users that are interested in spark-moderndataengineering are comparing it to the libraries listed below
Sorting:
- Data engineering with dbt, published by Packt☆81Updated last year
- Delta Lake examples☆226Updated 9 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆216Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆101Updated 2 years ago
- Snowflake Cookbook, published by Packt☆80Updated 2 years ago
- Playing with different packages of the Apache Spark☆30Updated last year
- Code snippets for Data Engineering Design Patterns book☆128Updated 3 months ago
- Apache Spark 3 - Structured Streaming Course Material☆121Updated last year
- ☆43Updated 5 months ago
- Spark style guide☆259Updated 9 months ago
- ☆91Updated 6 months ago
- Data Modeling with Snowflake, published by Packt☆65Updated 3 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆174Updated last year
- Code for dbt tutorial☆156Updated last month
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆124Updated 3 weeks ago
- ☆184Updated 4 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆33Updated last year
- Delta Lake helper methods in PySpark☆324Updated 10 months ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆26Updated last month
- Guide for databricks spark certification☆58Updated 4 years ago
- Repository of sample Databricks notebooks☆265Updated last year
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 5 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆143Updated last year
- Companion repository for the book 'Delta Lake Up and Running'☆47Updated 3 months ago
- ☆134Updated 5 months ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 4 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆69Updated 2 months ago