Data Contracts engine for the modern data stack. https://www.soda.io
β2,379Jun 29, 2026Updated this week
Alternatives and similar repositories for soda-core
Users that are interested in soda-core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Always know what to expect from your data.β11,603Jun 26, 2026Updated last week
- re_data - fix data issues before your users & CEO would discover them πβ1,567Apr 30, 2024Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesβ64Mar 23, 2026Updated 3 months ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.β3,159Jun 26, 2026Updated last week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,375Updated this week
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Compare tables within or across databasesβ2,991May 17, 2024Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.β3,625Jun 25, 2026Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β13,085Updated this week
- An Open Standard for lineage metadata collectionβ2,517Jun 25, 2026Updated last week
- Port(ish) of Great Expectations to dbt test macrosβ1,230Dec 16, 2024Updated last year
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interactingβ¦β4,776Jun 1, 2026Updated last month
- data load tool (dlt) is an open source Python library that makes data loading easy π οΈβ5,532Jun 25, 2026Updated last week
- Collect, aggregate, and visualize a data ecosystem's metadataβ2,228Updated this week
- An orchestration platform for the development, production, and observation of data assets.β15,763Jun 25, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MetricFlow allows you to define, build, and maintain metrics in code.β1,664Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.β9,790Updated this week
- Open-source data movement for ELT pipelines and AI agents β from APIs, databases & files to warehouses, lakes, and AI applications. Both β¦β21,540Jun 27, 2026Updated last week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β748Jun 18, 2026Updated 2 weeks ago
- Python API for Deequβ822Jun 11, 2026Updated 3 weeks ago
- The Open Context Layer for Data and AI , OpenMetadata is the open platform for building trusted data context and business semantics for β¦β14,334Updated this week
- Python SQL Parser and Transpilerβ9,373Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,541Jun 26, 2026Updated last week
- The Context Platform for your Data and AI Stackβ12,189Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Enforce Data Contractsβ942Updated this week
- π§ Build, run, and manage data pipelines for integrating and transforming data.β8,759Jun 24, 2026Updated last week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β853Apr 5, 2024Updated 2 years ago
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tabβ¦β514Jun 25, 2026Updated last week
- An open-source data logging library for machine learning models and data pipelines. π Provides visibility into data quality & model perfβ¦β2,825Jan 10, 2025Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,166May 19, 2026Updated last month
- Nessie: Transactional Catalog for Data Lakes with Git-like semanticsβ1,471Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β22,701Jun 25, 2026Updated last week
- A light-weight, flexible, and expressive statistical data testing libraryβ4,393Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- dbt adapter for DuckDBβ1,305Jun 22, 2026Updated last week
- Utility functions for dbt projects.β1,768Jun 27, 2026Updated last week
- the portable Python dataframe libraryβ6,585Updated this week
- Code review for data in dbtβ494Jan 3, 2025Updated last year
- Construct Apache Airflow DAGs Declaratively via YAML configuration filesβ1,440Jun 26, 2026Updated last week
- Agentic BI. Analytics at the speed of code β‘οΈβ5,923Jun 26, 2026Updated last week
- Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of codeβ1,223Updated this week