zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,089Updated this week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,572Updated last year
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,725Updated last week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β856Updated last year
- What's in your data? Extract schema, statistics and entities from datasetsβ1,518Updated last week
- Repository for the ActivitySchema spec and supporting materialsβ428Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,182Updated this week
- Data Pipeline Framework using the singer.io specβ652Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,279Updated this week
- Template for a data contract used in a data mesh.β476Updated last year
- Open source data observability platformβ326Updated 3 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,113Updated 6 months ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,158Updated this week
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β802Updated 3 years ago
- dbt + Metabase integrationβ544Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.β259Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slackβ316Updated 2 years ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,631Updated this week
- Port(ish) of Great Expectations to dbt test macrosβ1,201Updated 9 months ago
- Python API for Deequβ797Updated 6 months ago
- π Awesome Data Catalogs and Observability Platforms.β913Updated last month
- Dagster Labs' open-source data platform, built with Dagster.β398Updated this week
- β382Updated last year
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,207Updated this week
- Home of the Open Data Contract Standard (ODCS).β558Updated last week
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β477Updated 6 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β687Updated 4 months ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.β186Updated 2 years ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β463Updated last week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β375Updated 4 months ago
- PyAirbyte brings the power of Airbyte to every Python developer.β300Updated this week