OlivierBinette / er-evaluationLinks
An End-to-End Evaluation Framework for Entity Resolution Systems
☆29Updated last year
Alternatives and similar repositories for er-evaluation
Users that are interested in er-evaluation are comparing it to the libraries listed below
Sorting:
- Efficient String Comparison Functions and Fuzzy String Matching☆17Updated 3 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Continuous Benchmark of Filtering methods for Entity Resolution☆10Updated 11 months ago
- Bag of, not words, but tricks!☆68Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆32Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆80Updated 10 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 3 years ago
- Entity resolution using zero labeled examples☆28Updated 11 months ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- Pipeline components that support partial_fit.☆46Updated 11 months ago
- ☆55Updated last year
- An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.☆78Updated last month
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- ☆30Updated 3 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆144Updated 8 months ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- MinHash implementation in Python☆11Updated 10 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Easy PDF to text to spaCy text extraction in Python.☆39Updated 8 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆140Updated 11 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆29Updated 5 months ago
- Super Simple Similarities Service☆148Updated 2 months ago