Data Contracts engine for the modern data stack. https://www.soda.io
☆2,326Apr 7, 2026Updated this week
Alternatives and similar repositories for soda-core
Users that are interested in soda-core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Always know what to expect from your data.☆11,391Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,570Apr 30, 2024Updated last year
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Mar 23, 2026Updated 3 weeks ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,029Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,299Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Compare tables within or across databases☆2,989May 17, 2024Updated last year
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,605Apr 1, 2026Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆12,605Updated this week
- An Open Standard for lineage metadata collection☆2,396Updated this week
- Port(ish) of Great Expectations to dbt test macros☆1,218Dec 16, 2024Updated last year
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,756Apr 2, 2026Updated last week
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆5,208Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,156Apr 7, 2026Updated last week
- An orchestration platform for the development, production, and observation of data assets.☆15,312Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MetricFlow allows you to define, build, and maintain metrics in code.☆1,543Apr 6, 2026Updated last week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆9,637Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆21,058Updated this week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.☆737Feb 6, 2026Updated 2 months ago
- OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata rep…☆10,207Updated this week
- Python API for Deequ☆815Mar 9, 2026Updated last month
- Python SQL Parser and Transpiler☆9,108Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,468Updated this week
- The Metadata Platform for your Data and AI Stack☆11,775Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Enforce Data Contracts☆859Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,695Apr 2, 2026Updated last week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning m…☆856Apr 5, 2024Updated 2 years ago
- An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…☆2,811Jan 10, 2025Updated last year
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tab…☆500Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,149Apr 1, 2026Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆22,126Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,450Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆4,291Updated this week
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- dbt adapter for DuckDB☆1,265Updated this week
- Utility functions for dbt projects.☆1,730Jan 13, 2026Updated 3 months ago
- the portable Python dataframe library☆6,493Updated this week
- Code review for data in dbt☆495Jan 3, 2025Updated last year
- Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code☆1,177Updated this week
- Construct Apache Airflow DAGs Declaratively via YAML configuration files☆1,427Updated this week
- Self-serve BI to 10x your data team ⚡️☆5,686Updated this week