An open source, high scalability toolkit in Java for Entity Resolution.
☆224Jul 12, 2025Updated 10 months ago
Alternatives and similar repositories for JedAIToolkit
Users that are interested in JedAIToolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UI for JedAI Toolkit☆17May 20, 2022Updated 4 years ago
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆18Nov 18, 2020Updated 5 years ago
- A list of free data matching and record linkage software.☆406Feb 21, 2024Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆66Mar 29, 2024Updated 2 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆95Mar 22, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆619Jun 18, 2024Updated last year
- ☆18Jun 17, 2024Updated last year
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Jul 20, 2025Updated 10 months ago
- Entity resolution for Elasticsearch.☆167Mar 1, 2026Updated 3 months ago
- Distributed Bayesian Entity Resolution in Apache Spark☆60Jun 10, 2021Updated 5 years ago
- ☆32Sep 3, 2021Updated 4 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆311Apr 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆192May 29, 2024Updated 2 years ago
- Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.☆41Jul 12, 2022Updated 3 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆127Feb 22, 2024Updated 2 years ago
- Duke is a fast and flexible deduplication engine written in Java☆623Oct 11, 2023Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆112Aug 14, 2023Updated 2 years ago
- Read and query HDT documents with ease in Python☆13Mar 18, 2020Updated 6 years ago
- Clustering and Link Prediction Evaluation in R☆15Sep 23, 2023Updated 2 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆129Apr 30, 2026Updated last month
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Feb 9, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆114May 20, 2022Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- Collection of some algorithms for entity resolution☆28Sep 7, 2015Updated 10 years ago
- Stanford Entity-Resolution Framework☆24Jun 23, 2018Updated 7 years ago
- LEMON: Explainable Entity Matching☆19Apr 6, 2022Updated 4 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆15Jul 13, 2023Updated 2 years ago
- Scalable master data management, identity resolution, entity resolution, and deduplication using ML☆1,210Updated this week
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆2,192Updated this week
- ☆32Apr 15, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47May 11, 2018Updated 8 years ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 3 years ago
- Resources for PVLDB 2023 submission☆28Aug 28, 2024Updated last year
- Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.☆66Jul 10, 2023Updated 2 years ago
- Silk Linked Data Integration Framework☆251Updated this week
- 🕸 YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).☆15Aug 21, 2023Updated 2 years ago
- atyimo: probabilistic record linkage for massive administrative datasets☆10Jan 23, 2019Updated 7 years ago