moj-analytical-services / splink_graphLinks
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
☆10Updated last year
Alternatives and similar repositories for splink_graph
Users that are interested in splink_graph are comparing it to the libraries listed below
Sorting:
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- ☆12Updated 4 years ago
- Example Multi-Cycle, Multi-Touch Revenue and Cost Attribution Model☆27Updated last year
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- A repository for the best data content, from data science to data engineering☆21Updated last month
- CLI for data platform☆19Updated last year
- Automate and streamline the alerting & notification process for dbt test results🐞🚀☆17Updated last month
- A DBT adapter for iomete☆12Updated 2 months ago
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- MOVED TO GITLAB. A list/directory of awesome/helpful Looker and LookML work.☆20Updated 4 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A dbt package designed to help SQL based analysis of graphs☆21Updated 2 years ago
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆15Updated 8 months ago
- ☆20Updated last year
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 9 months ago
- An infrastructure as code approach to deploying Snowflake using Terraform☆26Updated 2 years ago
- lookML block for user journeys based on events☆16Updated 4 years ago
- Update a Google Data Catalog tag with dbt Cloud run metadata☆22Updated 4 years ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- ☆13Updated 2 weeks ago
- Styles for dbt on the net☆10Updated 7 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Python Package to manage and perform API requests using the Adobe v2 API.☆13Updated 2 years ago
- Collection of utility scripts to extract code so it can be upgraded to SnowFlake using the SnowConvert tool.☆20Updated last week
- Fully unit tested utility functions for data engineering. Python 3 only.☆17Updated 10 months ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Updated 5 years ago
- A serverless duckDB deployment at GCP☆40Updated 2 years ago
- dbt-core-interface is an MIT licensed high level wrapper for dbt-core that can be used to drive third party integrations such as servers,…☆35Updated last week
- dbt-generator - Generate and transform base models for dbt project☆47Updated 2 years ago