reisdebora / awesome-databricks
A curated list of awesome Databricks resources, including Spark
☆14Updated 2 months ago
Related projects: ⓘ
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆15Updated 7 months ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 4 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Spark package for checking data quality☆25Updated last year
- Optimizing Databricks Workload, published by Packt☆15Updated last year
- Spark and Delta Lake Workshop☆21Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆38Updated 3 years ago
- Spark data pipeline that processes movie ratings data.☆26Updated last month
- Spark app to merge different schemas☆23Updated 3 years ago
- ☆10Updated this week
- Collection of Databricks and Jupyter Notebooks☆22Updated 6 months ago
- Examples for High Performance Spark☆15Updated 3 weeks ago
- Awesome content all about Azure Databricks☆15Updated 2 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated last year
- Delta Lake Documentation☆45Updated 3 months ago
- Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution☆65Updated 4 years ago
- Yet Another (Spark) ETL Framework☆18Updated 10 months ago
- ☆12Updated 11 months ago
- ☆32Updated 3 months ago
- DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics f…☆38Updated 9 months ago
- Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorpo…☆71Updated 7 months ago
- Examples surrounding Databricks.☆55Updated 2 months ago
- Metadata Driven Development (m3d) is a cloud and platform agnostic framework for the automated creation, management and governance of dat…☆31Updated last year
- Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and mo…☆25Updated 3 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 3 years ago
- AWS Big Data Certification☆24Updated last year
- Code snippets for Data Engineering Design Patterns book☆27Updated this week
- Data Engineering with Spark and Delta Lake☆86Updated last year
- ☆35Updated last month
- ☆26Updated 4 years ago