jamesshocking / Spark-REST-API-UDF
Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF
☆47Updated last year
Related projects: ⓘ
- Delta Lake examples☆201Updated 3 months ago
- A dbt adapter for Databricks.☆211Updated this week
- Delta Lake Documentation☆45Updated 3 months ago
- Examples of Databricks Asset Bundles☆81Updated last week
- A Python Library to support running data quality rules while the spark job is running⚡☆161Updated last month
- Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used …☆309Updated last week
- Delta Lake helper methods in PySpark☆300Updated 2 weeks ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆185Updated this week
- ☆328Updated 3 weeks ago
- Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines☆143Updated this week
- Simple stream processing pipeline☆89Updated 3 months ago
- An example showing how to apply software engineering best practices to Databricks notebooks.☆118Updated last month
- Spark style guide☆255Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆155Updated 3 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆149Updated last month
- Execution of DBT models using Apache Airflow through Docker Compose☆111Updated last year
- how to unit test your PySpark code☆27Updated 3 years ago
- A Swiss-Army-knife for your Data Intelligence platform administration.☆104Updated last month
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆41Updated 2 months ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆69Updated this week
- Code for dbt tutorial☆138Updated 3 months ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆31Updated 3 years ago
- Examples of using Terraform to deploy Databricks resources☆204Updated 2 weeks ago
- ☆44Updated 2 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated 10 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆100Updated 2 months ago
- ☆25Updated last year
- ☆102Updated last month
- This repository helps teach people how to correctly define and create cumulative tables!☆209Updated last month