zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,072Updated this week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,566Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,145Updated this week
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β854Updated last year
- Repository for the ActivitySchema spec and supporting materialsβ421Updated 2 years ago
- What's in your data? Extract schema, statistics and entities from datasetsβ1,503Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,115Updated this week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,242Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,666Updated last week
- Template for a data contract used in a data mesh.β470Updated last year
- Data Pipeline Framework using the singer.io specβ649Updated last week
- Open source data observability platformβ326Updated 2 years ago
- Port(ish) of Great Expectations to dbt test macrosβ1,186Updated 7 months ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,101Updated 4 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.β254Updated last year
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β796Updated 2 years ago
- dbt + Metabase integrationβ540Updated 3 weeks ago
- The metrics layer for your data. Join us at https://metriql.com/slackβ311Updated 2 years ago
- π³ The stupidly simple CLI workspace for your data warehouse.β727Updated 2 years ago
- Python API for Deequβ789Updated 4 months ago
- β376Updated last year
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β474Updated 4 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β682Updated 2 months ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,151Updated this week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β550Updated last month
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β451Updated this week
- This dbt package contains a set of pre-built, pre-integrated Load and Transform dbt models for common SaaS applications.β260Updated 2 years ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.β185Updated 2 years ago
- Macros that generate dbt codeβ583Updated last month
- Home of the Open Data Contract Standard (ODCS).β519Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,499Updated this week