SuperNerb / Data-Governance-CompilationLinks
This is a compilation of Data Governance resources, examples, models and communities
☆14Updated 6 years ago
Alternatives and similar repositories for Data-Governance-Compilation
Users that are interested in Data-Governance-Compilation are comparing it to the libraries listed below
Sorting:
- Machine Learning in Snowflake☆24Updated 5 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆37Updated 6 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- Collection of utility scripts to extract code so it can be upgraded to SnowFlake using the SnowConvert tool.☆20Updated this week
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated last year
- ☆10Updated 3 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 4 years ago
- ☆11Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 5 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆12Updated last year
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- A python package to create a database on the platform using our moj data warehousing framework☆22Updated last week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆59Updated this week
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Tough and flexible tools for data analysis, transformation, validation and movement.☆139Updated last year
- Cost Efficient Data Pipelines with DuckDB☆55Updated 2 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- ☆16Updated 2 years ago
- ☆20Updated 8 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- ☆48Updated last year
- A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in …☆22Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆110Updated this week