dennyglee / databricks
Repository of sample Databricks notebooks
☆257Updated 11 months ago
Alternatives and similar repositories for databricks:
Users that are interested in databricks are comparing it to the libraries listed below
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- Repository used for Spark Trainings☆53Updated last year
- Spark style guide☆258Updated 5 months ago
- Guide for databricks spark certification☆58Updated 3 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆150Updated 7 months ago
- Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )☆85Updated 6 years ago
- Collection of Machine Learning Examples for Azure Databricks☆40Updated 4 years ago
- ETL pipeline using pyspark (Spark - Python)☆112Updated 4 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 5 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆213Updated last year
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- Examples surrounding Databricks.☆57Updated 8 months ago
- Notes on Apache Spark (pyspark)☆299Updated 6 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Azure Databricks Cookbook, Published by Packt☆58Updated last year
- Make your libraries magically appear in Databricks.☆47Updated last year
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆111Updated 2 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆44Updated last month
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Monitoring Azure Databricks jobs☆222Updated 5 months ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- A boilerplate for writing PySpark Jobs☆394Updated last year
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- MLFlow Spark Summit 2019 Presentation☆67Updated 5 years ago
- Delta Lake examples☆218Updated 5 months ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆122Updated 2 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago