An open source, high scalability toolkit in Java for Entity Resolution.
☆225Jul 12, 2025Updated 11 months ago
Alternatives and similar repositories for JedAIToolkit
Users that are interested in JedAIToolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity res…☆18Nov 18, 2020Updated 5 years ago
- A list of free data matching and record linkage software.☆406Feb 21, 2024Updated 2 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆66Mar 29, 2024Updated 2 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆96Mar 22, 2026Updated 3 months ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆620Jun 18, 2024Updated 2 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Nov 18, 2022Updated 3 years ago
- Entity resolution for Elasticsearch.☆168Mar 1, 2026Updated 3 months ago
- ☆13Feb 25, 2022Updated 4 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆60Jun 10, 2021Updated 5 years ago
- Entity resolution using zero labeled examples☆33Jun 29, 2024Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆43Dec 26, 2022Updated 3 years ago
- ☆32Sep 3, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆312Apr 17, 2024Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆22Oct 18, 2021Updated 4 years ago
- Tutorial code and data for the entity resolution workshops.☆45Jul 15, 2015Updated 10 years ago
- ☆192May 29, 2024Updated 2 years ago
- Similarity and distance measures for clustering and record linkage applications in R☆19Sep 23, 2025Updated 9 months ago
- Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.☆41Jul 12, 2022Updated 3 years ago
- Duke is a fast and flexible deduplication engine written in Java☆623Oct 11, 2023Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆112Aug 14, 2023Updated 2 years ago
- Read and query HDT documents with ease in Python☆13Mar 18, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Spark RDD with Lucene's query and entity linkage capabilities☆129Jun 23, 2026Updated last week
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Feb 9, 2023Updated 3 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,485Jul 29, 2025Updated 11 months ago
- End-to-End Deep Entity Resolution☆33Jul 14, 2021Updated 4 years ago
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆114May 20, 2022Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- Collection of some algorithms for entity resolution☆28Sep 7, 2015Updated 10 years ago
- LEMON: Explainable Entity Matching☆19Apr 6, 2022Updated 4 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆15Jul 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scalable master data management, identity resolution, entity resolution, and deduplication using ML☆1,226Updated this week
- The code of our AAAI'20 paper "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks"☆11Aug 10, 2020Updated 5 years ago
- ☆32Apr 15, 2023Updated 3 years ago
- Demonstration of how dedupe might be used as geocoder☆17Jun 21, 2022Updated 4 years ago
- Resources for PVLDB 2023 submission☆28Aug 28, 2024Updated last year
- Ontology-driven Linked Data processor and server for SPARQL backends. Apache License.☆66Jul 10, 2023Updated 2 years ago
- Silk Linked Data Integration Framework☆250Updated this week