zinggAI / zinggLinks
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β1,123Updated last week
Alternatives and similar repositories for zingg
Users that are interested in zingg are comparing it to the libraries listed below
Sorting:
- re_data - fix data issues before your users & CEO would discover them πβ1,571Updated last year
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β858Updated last year
- Repository for the ActivitySchema spec and supporting materialsβ431Updated 2 years ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,242Updated last week
- MetricFlow allows you to define, build, and maintain metrics in code.β1,385Updated 2 weeks ago
- Data Pipeline Framework using the singer.io specβ655Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,800Updated this week
- What's in your data? Extract schema, statistics and entities from datasetsβ1,529Updated 2 months ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β2,201Updated last week
- Schema modelling framework for decentralised domain-driven ownership of data.β259Updated 2 years ago
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bβ¦β803Updated 3 years ago
- Port(ish) of Great Expectations to dbt test macrosβ1,204Updated 11 months ago
- Template for a data contract used in a data mesh.β484Updated last year
- π Awesome Data Catalogs and Observability Platforms.β947Updated 3 months ago
- Open source data observability platformβ327Updated 3 years ago
- dbt + Metabase integrationβ557Updated last week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)β1,193Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.β2,723Updated last week
- The metrics layer for your data. Join us at https://metriql.com/slackβ319Updated 2 years ago
- β384Updated last year
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β479Updated 8 months ago
- π³ The stupidly simple CLI workspace for your data warehouse.β728Updated 2 years ago
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.β186Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,126Updated this week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β567Updated this week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ860Updated 2 years ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β2,280Updated this week
- This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tabβ¦β470Updated this week
- Macros that generate dbt codeβ623Updated 3 weeks ago
- Home of the Open Data Contract Standard (ODCS).β594Updated this week