SuperNerb / Data-Governance-CompilationLinks
This is a compilation of Data Governance resources, examples, models and communities
☆16Updated 6 years ago
Alternatives and similar repositories for Data-Governance-Compilation
Users that are interested in Data-Governance-Compilation are comparing it to the libraries listed below
Sorting:
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆74Updated last year
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆40Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 4 years ago
- Content for a talk on "The wonderful world of data quality tools in Python"☆18Updated 4 years ago
- Sample projects using Ploomber.☆86Updated last year
- a set of scripts to pull meta data and data profiling metrics from relational database systems☆77Updated last year
- ☆48Updated last year
- ☆10Updated 3 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆20Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).☆120Updated last month
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆113Updated 2 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆57Updated 5 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Complete Repository to become an expert is SQL Window Functions☆25Updated last year
- 🐍💨 Airflow tutorial for PyCon 2019☆86Updated 2 years ago
- Demo of DuckDB Spark API implements. Same Pyspark code, but DuckDB under the hood☆14Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆91Updated last week
- A cool simple example of functional data engineering☆34Updated 2 years ago
- Machine Learning in Snowflake☆24Updated 6 years ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆64Updated this week
- A python package to create a database on the platform using our moj data warehousing framework☆22Updated 3 months ago
- A curated list of awesome Databricks resources, including Spark☆22Updated last year
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- a collection of resources and blogs about Apache Superset☆87Updated 3 years ago