zinggAI / zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
β980Updated this week
Alternatives and similar repositories for zingg:
Users that are interested in zingg are comparing it to the libraries listed below
- re_data - fix data issues before your users & CEO would discover them πβ1,565Updated 9 months ago
- do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning mβ¦β854Updated 10 months ago
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.ioβ2,016Updated this week
- Port(ish) of Great Expectations to dbt test macrosβ1,141Updated 2 months ago
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hostβ¦β1,997Updated this week
- Repository for the ActivitySchema spec and supporting materialsβ410Updated 2 years ago
- Python API for Deequβ744Updated 4 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.β250Updated last year
- Template for a data contract used in a data mesh.β467Updated 11 months ago
- A curated list of awesome dbt resourcesβ1,284Updated 3 weeks ago
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamiltonβ862Updated last year
- Data Pipeline Framework using the singer.io specβ648Updated 2 weeks ago
- π³ The stupidly simple CLI workspace for your data warehouse.β726Updated 2 years ago
- π Awesome Data Catalogs and Observability Platforms.β784Updated 6 months ago
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wrβ¦β1,955Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backendsβ1,470Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,040Updated 4 months ago
- MetricFlow allows you to define, build, and maintain metrics in code.β1,180Updated this week
- dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipeβ¦β410Updated this week
- dbt + Metabase integrationβ495Updated this week
- Utility functions for dbt projects.β1,454Updated 3 weeks ago
- PySpark test helper methods with beautiful error messagesβ663Updated last month
- A curated list of awesome posts, videos, and articles on leading a data team (small and large)β526Updated last year
- Collection of dbt Tips and Tricksβ379Updated 2 years ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricksβ419Updated last week
- Home of the Open Data Contract Standard (ODCS).β444Updated last week
- Work with your web service, database, and streaming schemas in a single format.β337Updated 10 months ago
- List of `pre-commit` hooks to ensure the quality of your `dbt` projects.β625Updated 2 weeks ago
- Macros for calculating metricsβ216Updated last week
- Macros that generate dbt codeβ523Updated 3 weeks ago