newfront / spark-moderndataengineeringLinks
The source code for the book Modern Data Engineering with Apache Spark
☆36Updated 3 years ago
Alternatives and similar repositories for spark-moderndataengineering
Users that are interested in spark-moderndataengineering are comparing it to the libraries listed below
Sorting:
- Data Engineering with Spark and Delta Lake☆102Updated 2 years ago
- Delta Lake examples☆227Updated 10 months ago
- Data engineering with dbt, published by Packt☆85Updated last year
- Data Modeling with Snowflake, published by Packt☆66Updated 4 months ago
- ☆43Updated 6 months ago
- Repository of sample Databricks notebooks☆265Updated last year
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆219Updated 2 years ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆26Updated last month
- Snowflake Cookbook, published by Packt☆80Updated 2 years ago
- Companion repository for the book 'Delta Lake Up and Running'☆47Updated 4 months ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Updated 5 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆71Updated 3 months ago
- ☆90Updated 6 months ago
- Spark style guide☆260Updated 10 months ago
- ☆134Updated 5 months ago
- Code snippets for Data Engineering Design Patterns book☆142Updated 4 months ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56Updated 2 years ago
- Delta Lake helper methods in PySpark☆325Updated 11 months ago
- Apache Spark 3 - Structured Streaming Course Material☆121Updated last year
- Guide for databricks spark certification☆58Updated 4 years ago
- Delta Lake Documentation☆49Updated last year
- Magic to help Spark pipelines upgrade☆34Updated 10 months ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆57Updated last year
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated 11 months ago
- Spark and Delta Lake Workshop☆22Updated 3 years ago
- ☆86Updated 2 years ago
- ☆88Updated 2 years ago
- Code samples, etc. for Databricks☆65Updated 2 months ago
- Data Engineering on GCP☆36Updated 2 years ago