moj-analytical-services/splink_demos

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moj-analytical-services/splink_demos)

moj-analytical-services / splink_demos

Interactive notebooks containing demonstration code of the splink library

☆41

Alternatives and similar repositories for splink_demos

Users that are interested in splink_demos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

moj-analytical-services / etl_manager
View on GitHub
A python package to create a database on the platform using our moj data warehousing framework
☆21Mar 16, 2026Updated last month
dedupeio / doublemetaphone
View on GitHub
Python wrapper for a C++ Double Metaphone
☆15Jan 12, 2026Updated 3 months ago
moj-analytical-services / laurium
View on GitHub
Extract structured data from free text using large language models
☆19Updated this week
uktrade / matchbox
View on GitHub
Prototype record matching database.
☆25Apr 16, 2026Updated last week
moj-analytical-services / airflow-pdf2embeddings
View on GitHub
NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …
☆37Jun 22, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
NickCrews / mismo
View on GitHub
The SQL/Ibis powered sklearn of record linkage
☆23Apr 19, 2026Updated last week
J535D165 / data-matching-software
View on GitHub
A list of free data matching and record linkage software.
☆403Feb 21, 2024Updated 2 years ago
datasciencecampus / pprl_toolkit
View on GitHub
The privacy-preserving record linkage toolkit: a proof-of-concept public demo of next-gen data linkage techniques.
☆16May 22, 2024Updated last year
ncn-foreigners / BlockingPy
View on GitHub
Blocking records for record linkage and data deduplication based on ANN algorithms in Python.
☆20Mar 9, 2026Updated last month
ronald-smith-angel / owl-data-sanitizer
View on GitHub
A pyspark lib to validate data quality
☆19Nov 11, 2022Updated 3 years ago
rstudio / rmdexamples
View on GitHub
☆21Dec 19, 2019Updated 6 years ago
cleanzr / dblink
View on GitHub
Distributed Bayesian Entity Resolution in Apache Spark
☆60Jun 10, 2021Updated 4 years ago
ONSdigital / gptables
View on GitHub
Good Practice Tables - an XlsxWriter wrapper to write consistently formatted statistical tables to Excel.
☆45Nov 13, 2025Updated 5 months ago
cleanzr / record-linkage-tutorial
View on GitHub
A tutorial on entity resolution (record linkage or de-duplication)
☆65Jun 30, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
J535D165 / recordlinkage
View on GitHub
A powerful and modular toolkit for record linkage and duplicate detection in Python
☆1,048Feb 21, 2024Updated 2 years ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆22Jun 30, 2022Updated 3 years ago
meccaLeccaHi / record_linkage
View on GitHub
Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]
☆15Oct 6, 2017Updated 8 years ago
r-builder / cran2deb
View on GitHub
Creating Debian Packages from CRAN Sources
☆12Jul 1, 2020Updated 5 years ago
joelparkerhenderson / social-value-orientation
View on GitHub
Social value orientation (SVO) notes for pro-social pro-self concepts
☆13Apr 14, 2025Updated last year
Mik3M4n / BaCoN
View on GitHub
BAyesian COsmological Network - a bayesian neural network for classification of dark energy/modified gravity power spectra
☆14Dec 8, 2022Updated 3 years ago
aws-samples / amazon-sagemaker-studio-secure-sso
View on GitHub
This solution provides a way to deploy SageMaker Studio in a private and secure environment. The solution integrates with a Custom SAML 2…
☆14Apr 11, 2023Updated 3 years ago
aws-samples / users-and-team-management-with-amazon-sagemaker-and-aws-sso
View on GitHub
☆15May 10, 2023Updated 2 years ago
aws-samples / sagemaker-experiments-and-pipelines
View on GitHub
☆11Feb 15, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
nhennetier / pyeconometrics
View on GitHub
Python package for panel data econometrics
☆15Jun 5, 2018Updated 7 years ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 4 years ago
nikhilmishradevelop / zindi-airqo
View on GitHub
☆10Jul 1, 2020Updated 5 years ago
ropeladder / record-linkage-resources
View on GitHub
Resources for tackling record linkage / deduplication / data matching problems
☆127Feb 22, 2024Updated 2 years ago
WillKoehrsen / data-structures-and-algorithms-coursera
View on GitHub
Materials and assignments for Coursera Course: data structures and algorithms
☆15Sep 5, 2019Updated 6 years ago
rajat5ranjan / AV-Innoplexus-Online-Hiring-Hackathon-Sentiment-Analysis
View on GitHub
Innoplexus Online Hiring Hackathon: Sentiment Analysis organised by Analytics Vidya
☆11Jul 29, 2019Updated 6 years ago
kunalj101 / Innoplexus_sentiment_analysis_top_solutions
View on GitHub
Winners' code for Innoplexus Sentiment Analysis Hackathon: https://datahack.analyticsvidhya.com/contest/innoplexus-online-hiring-hackatho…
☆15Dec 8, 2022Updated 3 years ago
CalebEmelike / 3rd-Place-Solution-on-MachineHack-Video-Game-Sales-Prediction
View on GitHub
The gaming industry is certainly one of the thriving industries of the modern age and one of those that are most influenced by the advanc…
☆12Jun 29, 2020Updated 5 years ago
ZiCog / xoroshiro
View on GitHub
the xoroshiro32++ and xoroshiro64++ PRNG algorthims by David Blackman and Sebastiano Vigna in C++, Verilog, VHDL and SpinalHDL.
☆16Dec 2, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mfasiolo / sc2-2019
View on GitHub
Source code of Statistical Computing 2 website
☆12Feb 6, 2023Updated 3 years ago
JuliaGraphs / CommunityDetection.jl
View on GitHub
Community Detection algorithms for LightGraphs
☆15Mar 12, 2026Updated last month
sutugin / spark-streaming-jdbc-source
View on GitHub
☆26Apr 15, 2021Updated 5 years ago
aws-samples / amazon-sagemaker-studio-vpc-blog
View on GitHub
☆14Oct 28, 2021Updated 4 years ago
tslocz / hettreatreg
View on GitHub
OLS Weights on Heterogeneous Treatment Effects
☆10Jun 15, 2020Updated 5 years ago
velascoluis / serverless-duckdb
View on GitHub
A serverless duckDB deployment at GCP
☆41Aug 30, 2022Updated 3 years ago
J535D165 / recordlinkage-annotator
View on GitHub
A browser user interface for manual labeling of record pairs.
☆48Jun 23, 2023Updated 2 years ago