benniehaelen / delta-lake-up-and-runningLinks
Companion repository for the book 'Delta Lake Up and Running'
☆47Updated 3 months ago
Alternatives and similar repositories for delta-lake-up-and-running
Users that are interested in delta-lake-up-and-running are comparing it to the libraries listed below
Sorting:
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆124Updated 3 weeks ago
- Delta Lake examples☆226Updated 9 months ago
- Data Engineering with Spark and Delta Lake☆101Updated 2 years ago
- Code snippets for Data Engineering Design Patterns book☆127Updated 3 months ago
- Data Engineering with Databricks Cookbook, published by Packt☆93Updated last year
- Data engineering with dbt, published by Packt☆81Updated last year
- Unit testing using databricks connect☆31Updated 3 years ago
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆438Updated 3 weeks ago
- ☆134Updated 5 months ago
- ☆87Updated 2 years ago
- Stream processing with Azure Databricks☆140Updated 7 months ago
- ☆113Updated 3 years ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆85Updated 11 months ago
- ☆184Updated 4 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated 11 months ago
- This repository provides various demos/examples of using Snowpark for Python.☆278Updated last year
- Sample project to demonstrate data engineering best practices☆194Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆33Updated 4 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆69Updated 2 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆266Updated last year
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆33Updated last year
- Code repository for the "PySpark in Action" book☆204Updated last month
- ☆52Updated last year
- Delta Lake helper methods in PySpark☆324Updated 10 months ago
- Source Code Collection and Supplemental Material for the O'Reilly Snowflake Definitive Guide 1st Edition book☆98Updated 2 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆118Updated last year
- Code for "Efficient Data Processing in Spark" Course☆323Updated last month
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆255Updated 2 weeks ago
- Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course☆615Updated 2 months ago
- ADB Essentials Demos used in the webinars: https://databricks.com/p/webinar/azure-databricks-essentials-series☆61Updated 3 years ago