zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,097Updated this week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,571Updated last year
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β856Updated last year
- Repository for the ActivitySchema spec and supporting materialsβ430Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,204Updated this week
- What's in your data? Extract schema, statistics and entities from datasetsβ1,524Updated last month
- Data Pipeline Framework using the singer.io specβ654Updated 2 weeks ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,172Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,306Updated this week
- π³ The stupidly simple CLI workspace for your data warehouse.β728Updated 2 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,748Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.β259Updated last year
- Port(ish) of Great Expectations to dbt test macrosβ1,204Updated 10 months ago
- The metrics layer for your data. Join us at https://metriql.com/slackβ318Updated 2 years ago
- β384Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,116Updated 6 months ago
- dbt + Metabase integrationβ551Updated last week
- π Awesome Data Catalogs and Observability Platforms.β925Updated 2 months ago
- Open source data observability platformβ326Updated 3 years ago
- Template for a data contract used in a data mesh.β476Updated last year
- Python API for Deequβ800Updated 6 months ago
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β804Updated 3 years ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,228Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.β2,277Updated this week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β466Updated this week
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β478Updated 7 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β375Updated 5 months ago
- Work with your web service, database, and streaming schemas in a single format.β343Updated last month
- Macros that generate dbt codeβ605Updated 4 months ago
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.β444Updated 8 months ago
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,686Updated this week