adidas / lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
☆215Updated last week
Related projects: ⓘ
- Delta Lake helper methods in PySpark☆294Updated 2 weeks ago
- Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used …☆309Updated last week
- Automated migrations to Unity Catalog☆218Updated this week
- Delta Lake examples☆201Updated 3 months ago
- Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines☆142Updated this week
- Examples of Databricks Asset Bundles☆81Updated last week
- A dbt adapter for Databricks.☆211Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆185Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆161Updated last month
- Databricks SDK for Python (Beta)☆345Updated this week
- Databricks SQL Connector for Python☆153Updated this week
- ☆328Updated 3 weeks ago
- Examples of using Terraform to deploy Databricks resources☆203Updated 2 weeks ago
- Capture deep metrics on one or all assets within a Databricks workspace☆226Updated last week
- An example showing how to apply software engineering best practices to Databricks notebooks.☆118Updated last month
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆148Updated last month
- 🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.☆438Updated last week
- A Swiss-Army-knife for your Data Intelligence platform administration.☆104Updated last month
- Demos to implement your Databricks Lakehouse☆269Updated 3 weeks ago
- This repository helps teach people how to correctly define and create cumulative tables!☆209Updated last month
- Databricks CLI☆129Updated this week
- Delta Lake Documentation☆45Updated 3 months ago
- Code samples, etc. for Databricks☆59Updated last month
- Home of the Open Data Contract Standard (ODCS).☆309Updated this week
- PySpark test helper methods with beautiful error messages☆583Updated last week
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆40Updated last month
- ☆44Updated 2 months ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆69Updated this week
- Notebooks, terraform, tools to enable setting up Unity Catalog☆44Updated last year
- Best practices for working with Databricks from an IDE☆48Updated last year