OlivierBinette / er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
☆26Updated last year
Alternatives and similar repositories for er-evaluation:
Users that are interested in er-evaluation are comparing it to the libraries listed below
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Continuous Benchmark of Filtering methods for Entity Resolution☆9Updated 7 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Entity resolution using zero labeled examples☆28Updated 7 months ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- ☆30Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- ☆54Updated last year
- List of entity resolution software and resources.☆56Updated 11 months ago
- Python package for deduplication/entity resolution using active learning☆76Updated 5 months ago
- Pipeline components that support partial_fit.☆45Updated 7 months ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆148Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆141Updated 4 months ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- MinHash implementation in Python☆11Updated 5 months ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- ☆15Updated 2 years ago
- Fast, flexible name matching for large datasets☆70Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- A browser user interface for manual labeling of record pairs.☆44Updated last year
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆18Updated 3 years ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)☆14Updated 4 months ago
- Tutorial code and data for the entity resolution workshops.☆43Updated 9 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 10 months ago