sqlparser / python_data_lineage
Data lineage tools in python
☆29Updated 4 months ago
Alternatives and similar repositories for python_data_lineage:
Users that are interested in python_data_lineage are comparing it to the libraries listed below
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆134Updated 2 months ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- ☆69Updated last month
- dbt's adapter for dremio☆48Updated 2 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated 2 weeks ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆73Updated last year
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated 10 months ago
- dbt-core-interface is an MIT licensed high level wrapper for dbt-core that can be used to drive third party integrations such as servers,…☆31Updated last year
- Make dbt docs and Apache Superset talk to one another☆142Updated 2 months ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Unity Catalog UI☆40Updated 6 months ago
- StarSnow: HTTP Client for Snowflake database (HTTP get/post from SQL)☆26Updated 2 years ago
- ☆34Updated 11 months ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆116Updated 2 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆73Updated 3 years ago
- Fake Pandas / PySpark DataFrame creator☆46Updated last year
- A curated list of dagster code snippets for data engineers☆54Updated last year
- Snowflake Database, Schema, and Warehouse provisioning with Access Roles & Generating and Provisioning of Functional Roles & Snowflake So…☆42Updated 4 months ago
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆86Updated 3 weeks ago
- Metabase DuckDB Driver shipped as 3rd party plugin☆79Updated 11 months ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆54Updated last week
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆1Updated last week
- ☆143Updated last week
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆89Updated last week
- Utility functions for dbt projects running on Spark☆31Updated last month
- Package hub for dbt.☆29Updated this week
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆26Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆75Updated 2 years ago