moj-analytical-services / splink_graph
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
☆10Updated last year
Alternatives and similar repositories for splink_graph:
Users that are interested in splink_graph are comparing it to the libraries listed below
- ☆19Updated last year
- A dbt package designed to help SQL based analysis of graphs☆20Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Linear regression in SQL using dbt☆68Updated last month
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 5 months ago
- dbt-generator - Generate and transform base models for dbt project☆46Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- dbt package mimicking dplyr select-helpers semantics☆139Updated 5 months ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- A dbt-Core package for generating models from an activity stream.☆39Updated 10 months ago
- ☆19Updated 6 months ago
- dbt-core-interface is an MIT licensed high level wrapper for dbt-core that can be used to drive third party integrations such as servers,…☆31Updated last year
- The SQL/Ibis powered sklearn of record linkage☆13Updated last week
- Efficient String Comparison Functions and Fuzzy String Matching☆17Updated 2 years ago
- This repository contains CROW, the Clerical Resolution Online Widget, an open-source project designed to help data linkers with their cle…☆10Updated 3 months ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 4 years ago
- Materialize plugin for dbt☆12Updated 4 years ago
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆30Updated 3 months ago
- 📦 Example repository showing how to use dbt inside Visual Studio Code development containers☆40Updated 2 months ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆15Updated 5 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- dbt (data build tool) adapter for Oracle Autonomous Database☆50Updated 2 weeks ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 5 years ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆20Updated 2 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 4 years ago
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆20Updated last year