bartosz25 / data-ai-summit-2024
Visits sessionization pipeline used for the talk
☆12Updated 3 months ago
Related projects: ⓘ
- A Python Library to support running data quality rules while the spark job is running⚡☆161Updated last month
- Delta lake and filesystem helper methods☆48Updated 6 months ago
- Extensible Rules Engine for custom Dataframe / Dataset validation☆134Updated 4 months ago
- Unity Catalog UI☆40Updated 2 weeks ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆41Updated 2 months ago
- A Swiss-Army-knife for your Data Intelligence platform administration.☆104Updated last month
- DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics f…☆38Updated 9 months ago
- Cross-compiler and Data Reconciler into Databricks Lakehouse☆29Updated this week
- Yet Another (Spark) ETL Framework☆18Updated 11 months ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Updated 3 months ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆19Updated 3 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆53Updated last year
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆40Updated last month
- Delta Lake examples☆201Updated 3 months ago
- Metadata driven Databricks Delta Live Tables framework for bronze/silver pipelines☆143Updated this week
- Data validation library for PySpark 3.0.0☆34Updated last year
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- Delta Lake Documentation☆45Updated 3 months ago
- Demo project for dbt on Databricks☆27Updated 3 years ago
- Databricks Migration Tools☆43Updated 3 years ago
- Examples of Databricks Asset Bundles☆81Updated last week
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- Databricks SDK for Go☆48Updated this week
- Capture deep metrics on one or all assets within a Databricks workspace☆226Updated last week
- ✨ A Pydantic to PySpark schema library☆53Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆193Updated last week
- Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorpo…☆71Updated 7 months ago
- Sample base images for Databricks Container Services☆164Updated last week
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆46Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆185Updated this week