List of entity resolution software and resources.
☆113Mar 24, 2026Updated this week
Alternatives and similar repositories for Awesome-Entity-Resolution
Users that are interested in Awesome-Entity-Resolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 2 weeks ago
- ☆11Apr 2, 2021Updated 4 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆90Mar 22, 2026Updated last week
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Mar 11, 2026Updated 2 weeks ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆36Dec 3, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distributed Bayesian Entity Resolution in Apache Spark☆59Jun 10, 2021Updated 4 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,021Updated this week
- Similarity and distance measures for clustering and record linkage applications in R☆18Sep 23, 2025Updated 6 months ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,048Feb 21, 2024Updated 2 years ago
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,165Mar 14, 2026Updated 2 weeks ago
- This repository accompanies Hands On Entity Resolution by O'Reilly☆30Mar 1, 2024Updated 2 years ago
- A list of free data matching and record linkage software.☆400Feb 21, 2024Updated 2 years ago
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Prototype record matching database.☆25Updated this week
- A maximum-strength name parser for record linkage.☆39Sep 3, 2025Updated 6 months ago
- Clustering and Link Prediction Evaluation in R☆14Sep 23, 2023Updated 2 years ago
- Efficient String Comparison Functions and Fuzzy String Matching☆20Sep 21, 2025Updated 6 months ago
- Lightweight validation tool for checking function arguments and data analysis scripts.☆12Dec 24, 2024Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆135Feb 15, 2026Updated last month
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆24Updated this week
- Fast small desktop and web application designed to provide a visual interactive representation of RDF (Resource Description Framework) da…☆40Feb 24, 2026Updated last month
- Enables loading react components in Dash applications directly from local project files, without any need for a separate build process.☆33Aug 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Demo of pointblank / projmgr / GitHub Actions / Slack workflow for data quality monitoring☆16Mar 29, 2023Updated 3 years ago
- ☆18Oct 16, 2023Updated 2 years ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 6 months ago
- R package for fast bulk imports/exports from/to SQL Server with the bcp command line utility☆18Sep 6, 2025Updated 6 months ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- AI Agent Tools library for Graphlit Platform☆20Jan 14, 2025Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Updated this week
- 本项目旨在分享大模型相关技术原理以及实战经验。☆12Sep 6, 2023Updated 2 years ago
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data for the Chat With Your Data benchmark.☆148Dec 1, 2023Updated 2 years ago
- Sparkling Water for R☆13Sep 1, 2017Updated 8 years ago
- This repository houses the dash building blocks website.☆31Feb 22, 2025Updated last year
- Spatial Optimization for R☆45Mar 19, 2026Updated last week
- Cube Schema☆13Updated this week
- Fully unit tested utility functions for data engineering. Python 3 only.☆18Mar 12, 2026Updated 2 weeks ago
- Tutorial code and data for the entity resolution workshops.☆45Jul 15, 2015Updated 10 years ago