Valires / er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
☆25Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for er-evaluation
- Efficient String Comparison Functions and Fuzzy String Matching☆17Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆79Updated 2 months ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Updated last year
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Record matching and entity resolution at scale in Spark☆31Updated last year
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆17Updated 3 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆17Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆84Updated 2 years ago
- List of entity resolution software and resources.☆35Updated 8 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆137Updated 3 weeks ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆45Updated 6 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆147Updated last year
- Pipeline components that support partial_fit.☆43Updated 3 months ago
- ☆29Updated 2 years ago
- Knowledge Graph Extension for Python - Team Project 2020 @ Uni Mannheim☆76Updated 2 years ago
- ☆32Updated 3 years ago
- Entity resolution using zero labeled examples☆26Updated 4 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- spaCy match and replace, maintaining conjugation☆34Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 3 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆152Updated 2 years ago
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆71Updated this week
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)☆78Updated 2 years ago
- ☆53Updated 10 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago