zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,146Updated this week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,569Updated last year
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β856Updated last year
- Repository for the ActivitySchema spec and supporting materialsβ437Updated 3 years ago
- Data Contracts engine for the modern data stack. https://www.soda.ioβ2,281Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,468Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,242Updated this week
- Data Pipeline Framework using the singer.io specβ657Updated last week
- What's in your data? Extract schema, statistics and entities from datasetsβ1,541Updated 4 months ago
- Template for a data contract used in a data mesh.β486Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slackβ325Updated 2 years ago
- Port(ish) of Great Expectations to dbt test macrosβ1,204Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.β261Updated 2 years ago
- dbt + Metabase integrationβ569Updated 2 weeks ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,931Updated last week
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β477Updated 10 months ago
- Python API for Deequβ809Updated 2 weeks ago
- π Awesome Data Catalogs and Observability Platforms.β987Updated 5 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β718Updated last week
- π³ The stupidly simple CLI workspace for your data warehouse.β728Updated 2 years ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.β186Updated 2 years ago
- Macros that generate dbt codeβ633Updated last month
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tabβ¦β479Updated this week
- Open source data observability platformβ329Updated 3 years ago
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.β448Updated 11 months ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) filesβ272Updated last week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β574Updated 2 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,135Updated last week
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β805Updated 3 years ago
- This package contains macros and models to find DAG issues automaticallyβ525Updated last week
- Useful macros when performing data auditsβ392Updated 2 weeks ago