jaceklaskowski / learn-databricks
Notebooks to learn Databricks Lakehouse Platform
β24Updated last week
Alternatives and similar repositories for learn-databricks
Users that are interested in learn-databricks are comparing it to the libraries listed below
Sorting:
- π§± A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricksβ55Updated 5 months ago
- The resources of the preparation course for Databricks Data Engineer Professional certification examβ114Updated last month
- Delta Lake helper methods in PySparkβ323Updated 8 months ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.β70Updated 9 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for severβ¦β246Updated 3 months ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.β45Updated 3 months ago
- Delta Lake examplesβ224Updated 7 months ago
- Custom PySpark Data Sourcesβ50Updated 2 weeks ago
- Demonstration of using Files in Repos with Databricks Delta Live Tablesβ32Updated 10 months ago
- A Python Library to support running data quality rules while the spark job is runningβ‘β188Updated last week
- DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!β61Updated last month
- Stream processing with Azure Databricksβ138Updated 5 months ago
- Code snippets for Data Engineering Design Patterns bookβ106Updated last month
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflowβ215Updated last week
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr itβ64Updated 3 weeks ago
- Spark style guideβ258Updated 7 months ago
- Code samples, etc. for Databricksβ64Updated last month
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )β86Updated 6 years ago
- Sample project to demonstrate data engineering best practicesβ191Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflowβ33Updated 4 years ago
- β51Updated last year
- Examples surrounding Databricks.β58Updated 10 months ago
- β130Updated 3 months ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logsβ28Updated 7 months ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipelineβ152Updated 9 months ago
- Local Environment to Practice Data Engineeringβ142Updated 4 months ago
- This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.β144Updated 9 months ago
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consuβ¦β67Updated last year
- End to end data engineering projectβ54Updated 2 years ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the processβ46Updated 2 years ago