Valires / er-evaluation
An End-to-End Evaluation Framework for Entity Resolution Systems
☆26Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for er-evaluation
- Efficient String Comparison Functions and Fuzzy String Matching☆17Updated 2 years ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Python package for deduplication/entity resolution using active learning☆78Updated 2 months ago
- Bag of, not words, but tricks!☆68Updated last year
- List of entity resolution software and resources.☆38Updated 8 months ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆17Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆139Updated last month
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆45Updated 6 years ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 7 months ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 2 years ago
- ☆67Updated 2 years ago
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- ☆29Updated 2 years ago
- A very simple library for exploiting graph-of-words in NLP☆12Updated 3 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Updated last year
- Record matching and entity resolution at scale in Spark☆31Updated last year
- Entity resolution using zero labeled examples☆26Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆59Updated this week
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆17Updated 3 years ago
- 📚 Process PDFs, Word documents and more with spaCy☆75Updated this week
- Pipeline components that support partial_fit.☆43Updated 4 months ago
- Dataframe Integration with spaCy.☆101Updated 3 years ago
- A tool for quickly adding labels to unlabeled datasets☆20Updated 10 months ago