A maximum-strength name parser for record linkage.
☆39Sep 3, 2025Updated 6 months ago
Alternatives and similar repositories for nominally
Users that are interested in nominally are comparing it to the libraries listed below
Sorting:
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated last month
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Feb 24, 2026Updated last week
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆89Nov 3, 2025Updated 4 months ago
- CLK hash: hash pii for entity matching☆48May 12, 2025Updated 9 months ago
- Interactive notebooks containing demonstration code of the splink library☆40Updated this week
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- Introduction to git for social science students (not software developers)☆11Apr 15, 2019Updated 6 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆19Nov 28, 2025Updated 3 months ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- ☆31Updated this week
- A list of free data matching and record linkage software.☆401Feb 21, 2024Updated 2 years ago
- Official Python SDK for Minds☆13Jan 28, 2026Updated last month
- financial analysis that has been behind the moat of "Wall St" for years, opened up to everybody. Simple investment strategies with comple…☆15Oct 15, 2025Updated 4 months ago
- ☆14Nov 15, 2025Updated 3 months ago
- ☆21Dec 19, 2019Updated 6 years ago
- Various presentations I've given on things.☆14Jan 25, 2022Updated 4 years ago
- ☆16Updated this week
- Text Processing & Segmentation Framework☆27Sep 18, 2025Updated 5 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- The SQL/Ibis powered sklearn of record linkage☆24Feb 9, 2026Updated 3 weeks ago
- Tools for Managing Survey Data, Creating Tables of Estimates and Data Summaries☆47Aug 12, 2025Updated 6 months ago
- Rust crate for entity parsing☆18Dec 26, 2022Updated 3 years ago
- Deduplicate and parse list of `dirty names'☆23Nov 4, 2020Updated 5 years ago
- SenateTrades: what stocks are your senators buying?☆36Jun 29, 2022Updated 3 years ago
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆24Feb 20, 2026Updated last week
- Find and replace erroneous fields in data using validation rules☆22Dec 10, 2025Updated 2 months ago
- Daily TV News Summary using GPT☆24May 16, 2025Updated 9 months ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)☆25Updated this week
- Data from Google's Covid-19 Mobility Report☆48Apr 17, 2020Updated 5 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆59Jun 10, 2021Updated 4 years ago
- A simple Python module for parsing human names into their individual components☆702May 28, 2024Updated last year
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆32Apr 8, 2025Updated 10 months ago
- PostgreSQL extension for vector search, embeddings, and ML, plus NeuronAgent runtime and NeuronMCP server.☆41Feb 3, 2026Updated last month
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,980Updated this week
- Entity resolution for Elasticsearch.☆166Updated this week
- Random xkcd comic in a JupyterLab panel☆31Aug 8, 2023Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆616May 15, 2025Updated 9 months ago
- This is a ggplot2 geom for plotting and comparing the ROC curves☆10Jul 6, 2016Updated 9 years ago
- A light-weight wrapper for the Datawrapper API.☆89Updated this week