goodwillpunning / hyperleaup
Create and manipulate Tableau Hyper files from Apache Spark DataFrames and Spark SQL
☆29Updated 5 months ago
Related projects: ⓘ
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆41Updated 2 months ago
- A Swiss-Army-knife for your Data Intelligence platform administration.☆104Updated last month
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆40Updated last month
- Examples of Databricks Asset Bundles☆81Updated last week
- ☆16Updated last month
- Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines☆143Updated this week
- Best practices for working with Databricks from an IDE☆48Updated last year
- Fake Pandas / PySpark DataFrame creator☆35Updated 6 months ago
- Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorpo…☆71Updated 7 months ago
- A dbt adapter for Databricks.☆211Updated this week
- Utility functions for dbt projects running on Spark☆30Updated 10 months ago
- Spark app to merge different schemas☆23Updated 3 years ago
- Rules based grant management for Snowflake☆40Updated 5 years ago
- Data engineering with dbt, published by Packt☆55Updated 6 months ago
- Yet Another (Spark) ETL Framework☆18Updated 11 months ago
- An example showing how to apply software engineering best practices to Databricks notebooks.☆118Updated last month
- A Python Library to support running data quality rules while the spark job is running⚡☆161Updated last month
- Playing with different packages of the Apache Spark☆26Updated 3 months ago
- Delta Lake examples☆201Updated 3 months ago
- A bunch of hacks developed around dbt☆48Updated 4 years ago
- Delta Lake Documentation☆45Updated 3 months ago
- Databricks Migration Tools☆43Updated 3 years ago
- Databricks SQL Connector for Python☆153Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆185Updated this week
- An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.☆46Updated 8 months ago
- Extensible Rules Engine for custom Dataframe / Dataset validation☆134Updated 4 months ago
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆22Updated 5 months ago
- A PyTest plugin to speed up your tests which depend on Snowflake sessions☆27Updated 7 months ago
- Databricks CLI☆130Updated this week
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆38Updated 6 months ago