List of entity resolution software and resources.
☆126Mar 24, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-Entity-Resolution
Users that are interested in Awesome-Entity-Resolution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated 3 months ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Updated this week
- An End-to-End Evaluation Framework for Entity Resolution Systems☆37Dec 3, 2023Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆60Jun 10, 2021Updated 5 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,202Jun 12, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Similarity and distance measures for clustering and record linkage applications in R☆19Sep 23, 2025Updated 8 months ago
- ☆15Aug 11, 2022Updated 3 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- Scalable master data management, identity resolution, entity resolution, and deduplication using ML☆1,214Updated this week
- Prototype record matching database.☆26Jun 12, 2026Updated last week
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- A maximum-strength name parser for record linkage.☆41Sep 3, 2025Updated 9 months ago
- Clustering and Link Prediction Evaluation in R☆15Sep 23, 2023Updated 2 years ago
- Permutation Test for Regression Discontinuity and Regression Kink Designs☆13Dec 31, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Efficient String Comparison Functions and Fuzzy String Matching☆20Sep 21, 2025Updated 8 months ago
- The SQL/Ibis powered sklearn of record linkage☆24Jun 12, 2026Updated last week
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,479Jul 29, 2025Updated 10 months ago
- Lightweight validation tool for checking function arguments and data analysis scripts.☆12Dec 24, 2024Updated last year
- Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)☆19May 27, 2026Updated 3 weeks ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆140Feb 15, 2026Updated 4 months ago
- SAE Unit/area Models and Methods for Estimation in R☆26May 10, 2026Updated last month
- Enhance SSR for Python☆14May 7, 2024Updated 2 years ago
- Repo for MGraph project☆13Jan 10, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast small desktop and web application designed to provide a visual interactive representation of RDF (Resource Description Framework) da…☆46Jun 11, 2026Updated last week
- Demo of pointblank / projmgr / GitHub Actions / Slack workflow for data quality monitoring☆17Mar 29, 2023Updated 3 years ago
- Hebrew oriented NER spaCy pipeline☆20Aug 8, 2024Updated last year
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 9 months ago
- R package for fast bulk imports/exports from/to SQL Server with the bcp command line utility☆18Sep 6, 2025Updated 9 months ago
- A browser user interface for manual labeling of record pairs.☆48Jun 23, 2023Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆67Jun 9, 2026Updated last week
- A public API to get CRAN package check status. Updated once daily. #rstats☆24Updated this week
- Documentation for the dash-extensions library☆11Aug 6, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 8 months ago
- Data for the Chat With Your Data benchmark.☆153Dec 1, 2023Updated 2 years ago
- Sparkling Water for R☆13Sep 1, 2017Updated 8 years ago
- This repository houses the dash building blocks website.☆31Feb 22, 2025Updated last year
- Cube Schema☆14Jun 9, 2026Updated last week
- I will be adding different kind of opensource data extraction tools code using python☆10Nov 15, 2024Updated last year
- Tutorial code and data for the entity resolution workshops.☆45Jul 15, 2015Updated 10 years ago