anhaidgroup / py_stringmatching
A comprehensive and scalable set of string tokenizers and similarity measures in Python
☆136Updated 6 months ago
Alternatives and similar repositories for py_stringmatching:
Users that are interested in py_stringmatching are comparing it to the libraries listed below
- ☆188Updated 8 months ago
- Scalable String Similarity Joins in Python☆38Updated 7 months ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Abydos NLP/IR library for Python☆184Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆108Updated last year
- Python package for performing Entity and Text Matching using Deep Learning.☆574Updated 7 months ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆148Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆26Updated last year
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆215Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆141Updated 3 months ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆268Updated 9 months ago
- ☆32Updated 3 years ago
- A machine learning tool for fishing entities☆257Updated this week
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- PYthon Automated Term Extraction☆309Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- Extraction Toolkit☆82Updated 3 years ago
- FlexMatcher is a schema matching package in Python which handles the problem of matching multiple schemas to a single mediated schema.☆29Updated 2 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- Tutorial code and data for the entity resolution workshops.☆43Updated 9 years ago
- ☆70Updated 2 years ago
- For extracting measurements and related entities from text☆57Updated 4 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Updated last year
- A spaCy wrapper for DBpedia Spotlight☆107Updated last year
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Python package for deduplication/entity resolution using active learning☆76Updated 5 months ago