Generate and Visualize Data Lineage from query history
☆326Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for data-lineage
Users that are interested in data-lineage are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Catalog for Databases and Data Warehouses☆36Jan 15, 2024Updated 2 years ago
- SQL Lineage Analysis Tool powered by Python☆1,631Updated this week
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆338Jan 5, 2024Updated 2 years ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,149Updated this week
- An Open Standard for lineage metadata collection☆2,362Mar 20, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Visualise your Kedro data and machine-learning pipelines and track your experiments.☆743Updated this week
- Egeria core☆899Updated this week
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Feb 8, 2023Updated 3 years ago
- Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.☆148Jun 3, 2024Updated last year
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,311Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,751Mar 20, 2026Updated last week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,287Updated this week
- Data Lineage Tracking And Visualization Solution☆656Mar 20, 2026Updated last week
- Document, sample code and other materials for SQLFlow☆1,025Mar 12, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,389Mar 17, 2026Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,975Mar 19, 2026Updated last week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,569Apr 30, 2024Updated last year
- A federated, open-source data catalog for all your big data and small data☆587Mar 12, 2026Updated 2 weeks ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆268Mar 4, 2026Updated 3 weeks ago
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tab…☆495Mar 16, 2026Updated last week
- The Metadata Platform for your Data and AI Stack☆11,720Updated this week
- The metrics layer for your data. Join us at https://metriql.com/slack☆327Mar 29, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆728Feb 8, 2023Updated 3 years ago
- Compare tables within or across databases☆2,990May 17, 2024Updated last year
- PostgreSQL Languages AST and statements prettifier: master branch covers PG10, v2 branch covers PG12, v3 covers PG13, v4 covers PG14, v5 …☆392Updated this week
- Python SQL Parser and Transpiler☆9,064Updated this week
- ☆83Feb 25, 2025Updated last year
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata rep…☆8,976Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆587Jan 24, 2024Updated 2 years ago
- A collection of utilities and tools for teams and organizations using dbt☆15Nov 24, 2023Updated 2 years ago
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆391Mar 12, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This package contains macros and models to find DAG issues automatically☆545Feb 27, 2026Updated last month
- Always know what to expect from your data.☆11,280Updated this week
- SQL语法词法分析 SQL表级血缘 SQL字段级别血缘 SQL函数血缘 SQL编译器☆17Nov 1, 2022Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆161Dec 10, 2022Updated 3 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,596Updated this week
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆53Oct 9, 2023Updated 2 years ago
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineer…☆579Feb 5, 2026Updated last month