SuperNerb / Data-Governance-Compilation
This is a compilation of Data Governance resources, examples, models and communities
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Data-Governance-Compilation
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆75Updated 7 months ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆12Updated 11 months ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆12Updated 2 years ago
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 2 months ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆27Updated 2 years ago
- A cool simple example of functional data engineering☆33Updated last year
- Collection of utility scripts to extract code so it can be upgraded to SnowFlake using the SnowConvert tool.☆11Updated 5 months ago
- A serverless duckDB deployment at GCP☆35Updated 2 years ago
- A curated list of data wrangling resources☆32Updated 6 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 4 months ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 3 years ago
- Ibis analytics, with Ibis (and more!)☆19Updated last month
- Utility functions for dbt projects running on Spark☆31Updated last year
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 5 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- ☆19Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆85Updated this week
- Build your feature store with macros right within your dbt repository☆37Updated last year
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16Updated 3 years ago
- The Taxonomy for ETL Automation Metadata (TEAM) is a tool for design metadata management geared towards data warehouse automation. It is …☆34Updated 2 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated last year
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆71Updated last year
- IPython magics to work with DBT☆14Updated 2 years ago
- DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, aud…☆25Updated 2 weeks ago
- Postgres utility package for dbt (getdbt.com)☆18Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- Data lineage tools in python☆24Updated this week
- Fuzzy matching function in spark (https://spark-packages.org/package/itspawanbhardwaj/spark-fuzzy-matching)☆23Updated 4 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago