An End-to-End Evaluation Framework for Entity Resolution Systems
☆36Dec 3, 2023Updated 2 years ago
Alternatives and similar repositories for er-evaluation
Users that are interested in er-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient String Comparison Functions and Fuzzy String Matching☆20Sep 21, 2025Updated 6 months ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆24Mar 25, 2026Updated 3 weeks ago
- ☆11Apr 2, 2021Updated 5 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Apr 9, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Perform Bayesian record linkage with a one-to-one matching assumption.☆11Jul 9, 2020Updated 5 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated last month
- Clustering and Link Prediction Evaluation in R☆14Sep 23, 2023Updated 2 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Apr 5, 2023Updated 3 years ago
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Jul 20, 2025Updated 8 months ago
- Similarity and distance measures for clustering and record linkage applications in R☆18Sep 23, 2025Updated 6 months ago
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆137Feb 15, 2026Updated 2 months ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Aug 20, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- R package for fast bulk imports/exports from/to SQL Server with the bcp command line utility☆18Sep 6, 2025Updated 7 months ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆92Mar 22, 2026Updated 3 weeks ago
- Learned string similarity for entity names using optimal transport.☆35Nov 17, 2020Updated 5 years ago
- An R package that returns tidy data from the World Prison Brief website.☆17Feb 14, 2021Updated 5 years ago
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- Create country-year/month/day panels consistent with the COW or Gleditsch & Ward independent states lists☆14Aug 25, 2025Updated 7 months ago
- Task based code snippet examples for Senzing V3.☆16Apr 10, 2026Updated last week
- BisPy - Python bisimulation library☆16Jan 21, 2022Updated 4 years ago
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Files and instructions for unattended/automatic setup of a Raspberry Pi using only the boot partition which you can see on a flashed SD c…☆14Aug 7, 2021Updated 4 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- A maximum-strength name parser for record linkage.☆40Sep 3, 2025Updated 7 months ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 3 months ago
- Python-Markdown plugin for image captions☆12May 24, 2023Updated 2 years ago
- R code for common, repeatable data wrangling and analysis of SafeGraph data☆17Nov 20, 2022Updated 3 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- Bluetooth Indoor Positioning with DNNs☆13Mar 28, 2022Updated 4 years ago
- DuckDB Engine as Google Sheets Library☆20Dec 14, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- catchr: Flexible, useful tools for dealing with conditions in R, for new users and veterans☆17Sep 25, 2021Updated 4 years ago
- UI for JedAI Toolkit☆17May 20, 2022Updated 3 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆308Apr 17, 2024Updated 2 years ago
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆46Jun 26, 2024Updated last year
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,180Mar 27, 2026Updated 3 weeks ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Source code for "A Critical Re-evaluation of Neural Methods for Entity Alignment"☆16Oct 4, 2022Updated 3 years ago