zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,133Updated this week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,569Updated last year
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β857Updated last year
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,271Updated this week
- What's in your data? Extract schema, statistics and entities from datasetsβ1,538Updated 3 months ago
- Repository for the ActivitySchema spec and supporting materialsβ433Updated 3 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,851Updated last week
- Data Pipeline Framework using the singer.io specβ657Updated 2 weeks ago
- MetricFlow allows you to define, build, and maintain metrics in code.β1,436Updated last week
- Python API for Deequβ809Updated 9 months ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,225Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.β260Updated 2 years ago
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β804Updated 3 years ago
- Port(ish) of Great Expectations to dbt test macrosβ1,206Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,134Updated 2 weeks ago
- Template for a data contract used in a data mesh.β486Updated last year
- The metrics layer for your data. Join us at https://metriql.com/slackβ324Updated 2 years ago
- dbt + Metabase integrationβ567Updated last week
- Open source data observability platformβ328Updated 3 years ago
- π³ The stupidly simple CLI workspace for your data warehouse.β728Updated 2 years ago
- β383Updated last year
- π Awesome Data Catalogs and Observability Platforms.β969Updated 5 months ago
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)β1,211Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,849Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,311Updated this week
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β377Updated 7 months ago
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β478Updated 10 months ago
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β572Updated last month
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tabβ¦β473Updated last week
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.β186Updated 2 years ago
- Home of the Open Data Contract Standard (ODCS).β636Updated this week