moj-analytical-services / splink_graph
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
☆10Updated 10 months ago
Related projects: ⓘ
- ☆12Updated 3 years ago
- A dbt package designed to help SQL based analysis of graphs☆20Updated last year
- Build your feature store with macros right within your dbt repository☆37Updated last year
- Example Multi-Cycle, Multi-Touch Revenue and Cost Attribution Model☆19Updated 7 months ago
- ☆26Updated this week
- ☆11Updated last year
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆19Updated last year
- dbt-generator - Generate and transform base models for dbt project☆44Updated last year
- Department of Education (DOE) for New South Wales (AUS) data stack in a box☆19Updated 3 weeks ago
- An experimental Athena extension for DuckDB 🐤☆49Updated 7 months ago
- 📦 Example repository showing how to use dbt inside Visual Studio Code development containers☆39Updated last year
- MOVED TO GITLAB. A list/directory of awesome/helpful Looker and LookML work.☆19Updated 3 years ago
- An "ERD for LookML". Now available for download on the Looker Marketplace.☆15Updated this week
- Update a Google Data Catalog tag with dbt Cloud run metadata☆21Updated 3 years ago
- dbt package mimicking dplyr select-helpers semantics☆137Updated 3 weeks ago
- ☆14Updated last year
- Data-aware orchestration with dagster, dbt, and airbyte☆29Updated last year
- A serverless duckDB deployment at GCP☆34Updated 2 years ago
- Sophisticated alerting block for looker built in Lookml☆15Updated 3 years ago
- DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, S…☆12Updated 2 months ago
- Linear regression in SQL using dbt☆64Updated 3 weeks ago
- A template for dockerized dbt-Core projects with VS Code Dev Containers.☆18Updated last year
- A repository for the best data content, from data science to data engineering☆19Updated last month
- Interactive notebooks containing demonstration code of the splink library☆38Updated 8 months ago
- Activity Schema dbt package☆14Updated 10 months ago
- [DEPRECATED] A dbt adapter for Excel.☆89Updated last year
- A dbt-Core package for generating models from an activity stream.☆38Updated 5 months ago
- dbt starter code for enterprise Snowflake usage data artifacts☆20Updated 2 years ago
- Efficient String Comparison Functions and Fuzzy String Matching☆17Updated 2 years ago