zinggAI / zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,012Updated this week
Alternatives and similar repositories for zingg:
Users that are interested in zingg are comparing it to the libraries listed below
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,063Updated this week
- Repository for the ActivitySchema spec and supporting materialsβ415Updated 2 years ago
- re_data - fix data issues before your users & CEO would discover them πβ1,563Updated 11 months ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β469Updated last month
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β850Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slackβ306Updated 2 years ago
- Python API for Deequβ761Updated 2 weeks ago
- Schema modelling framework for decentralised domain-driven ownership of data.β252Updated last year
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,548Updated last week
- Write python locally, execute SQL in your data warehouseβ269Updated 2 years ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,048Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,200Updated last week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β535Updated last month
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,242Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,067Updated 2 weeks ago
- dbt + Metabase integrationβ510Updated last week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β429Updated this week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ861Updated last year
- Port(ish) of Great Expectations to dbt test macrosβ1,158Updated 4 months ago
- Macros that generate dbt codeβ550Updated 2 weeks ago
- Data Pipeline Framework using the singer.io specβ647Updated last week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β369Updated this week
- Open source data observability platformβ323Updated 2 years ago
- Template for a data contract used in a data mesh.β470Updated last year
- Macros for calculating metricsβ218Updated 2 months ago
- Malloy is an experimental language for describing data relationships and transformations.β2,120Updated last week
- This package contains macros and models to find DAG issues automaticallyβ474Updated 3 weeks ago
- Apache Airflow integration for dbtβ402Updated 11 months ago
- CLI tool for dbt users to simplify creation of staging models (yml and sql) filesβ261Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,025Updated this week