SuperNerb / Data-Governance-CompilationLinks
This is a compilation of Data Governance resources, examples, models and communities
☆18Updated 6 years ago
Alternatives and similar repositories for Data-Governance-Compilation
Users that are interested in Data-Governance-Compilation are comparing it to the libraries listed below
Sorting:
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated 2 years ago
- ☆10Updated 3 years ago
- Complete Repository to become an expert is SQL Window Functions☆25Updated last year
- Tough and flexible tools for data analysis, transformation, validation and movement.☆140Updated 2 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated this week
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆127Updated 4 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Machine Learning in Snowflake☆23Updated 6 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 3 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Updated last year
- a collection of resources and blogs about Apache Superset☆88Updated 4 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆97Updated last week
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆69Updated 3 weeks ago
- Azure Databricks - Advent of 2020 Blogposts☆63Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆89Updated 4 years ago
- ☆11Updated last year
- A tool to generate PySpark schema from JSON.☆28Updated 2 years ago
- ☆20Updated 8 years ago
- ☆23Updated 6 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 3 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- A cool simple example of functional data engineering☆34Updated 2 years ago