zinggAI / zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β957Updated this week
Related projects β
Alternatives and complementary repositories for zingg
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ1,913Updated last week
- re_data - fix data issues before your users & CEO would discover them πβ1,552Updated 6 months ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β853Updated 7 months ago
- Port(ish) of Great Expectations to dbt test macrosβ1,083Updated 2 months ago
- MetricFlow allows you to define, build, and maintain metrics in code.β1,146Updated this week
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β1,934Updated this week
- dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)β923Updated this week
- π Awesome Data Catalogs and Observability Platforms.β727Updated 3 months ago
- Data Pipeline Framework using the singer.io specβ641Updated this week
- dbt + Metabase integrationβ472Updated 2 weeks ago
- Efficient data transformation and modeling framework that is backwards compatible with dbt.β1,824Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.β247Updated 11 months ago
- Repository for the ActivitySchema spec and supporting materialsβ401Updated last year
- Collection of dbt Tips and Tricksβ369Updated 2 years ago
- Utility functions for dbt projects.β1,379Updated last week
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β599Updated last week
- Python API for Deequβ730Updated last month
- Template for a data contract used in a data mesh.β464Updated 8 months ago
- Work with your web service, database, and streaming schemas in a single format.β332Updated 7 months ago
- The metrics layer for your data. Join us at https://metriql.com/slackβ298Updated last year
- A curated list of awesome dbt resourcesβ1,191Updated last month
- β347Updated 9 months ago
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β393Updated this week
- This package contains macros and models to find DAG issues automaticallyβ451Updated this week
- Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data accesβ¦β422Updated this week
- Turning PySpark Into a Universal DataFrame APIβ323Updated this week
- A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineerβ¦β510Updated 3 months ago
- Apache Airflow integration for dbtβ396Updated 6 months ago
- This dbt package contains macros to support unit testing that can be (re)used across dbt projects.β423Updated 3 months ago
- Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.β350Updated this week