Compare tables within or across databases
☆2,988May 17, 2024Updated last year
Alternatives and similar repositories for data-diff
Users that are interested in data-diff are comparing it to the libraries listed below
Sorting:
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,914Feb 18, 2026Updated last week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,255Updated this week
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,298Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,569Apr 30, 2024Updated last year
- Python SQL Parser and Transpiler☆8,965Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,279Updated this week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆729Feb 6, 2026Updated 3 weeks ago
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆9,536Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,495Updated this week
- Port(ish) of Great Expectations to dbt test macros☆1,209Dec 16, 2024Updated last year
- Provides automated YAML management and a streamlit workbench. Designed to optimize dev workflows.☆609Feb 5, 2026Updated 3 weeks ago
- Always know what to expect from your data.☆11,162Feb 20, 2026Updated last week
- This package contains macros and models to find DAG issues automatically☆532Feb 4, 2026Updated 3 weeks ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆855Apr 5, 2024Updated last year
- An orchestration platform for the development, production, and observation of data assets.☆15,007Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆20,749Updated this week
- CLI tool for dbt users to simplify creation of staging models (yml and sql) files☆274Jan 29, 2026Updated last month
- Malloy is a modern open source language for describing data relationships and transformations.☆2,405Updated this week
- Useful macros when performing data audits☆396Jan 20, 2026Updated last month
- A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts☆388Feb 2, 2026Updated 3 weeks ago
- dbt adapter for DuckDB☆1,234Updated this week
- Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.☆2,509Updated this week
- Utility functions for dbt projects.☆1,700Jan 13, 2026Updated last month
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆4,949Updated this week
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tab…☆489Updated this week
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆5,954Feb 18, 2026Updated last week
- High-performance diffing of large datasets across databases☆516Aug 18, 2025Updated 6 months ago
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,740Feb 19, 2026Updated last week
- Code review for data in dbt☆494Jan 3, 2025Updated last year
- An Open Standard for lineage metadata collection☆2,324Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,653Feb 20, 2026Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Dec 5, 2023Updated 2 years ago
- Self-serve BI to 10x your data team ⚡️☆5,580Updated this week
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.☆449Feb 11, 2025Updated last year
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,364Feb 20, 2026Updated last week
- PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement☆10,732Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,652Feb 21, 2026Updated last week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineer…☆577Feb 5, 2026Updated 3 weeks ago
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,239Updated this week