mjvallone / phruzz-matcher
Combination of the RapidFuzz library with Spacy PhraseMatcher
☆11Updated 3 years ago
Alternatives and similar repositories for phruzz-matcher:
Users that are interested in phruzz-matcher are comparing it to the libraries listed below
- Repository hosting the common code for the entity-fishing clients☆10Updated 11 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆161Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- Generate reports for spaCy models.☆29Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆32Updated 2 years ago
- ☆54Updated last year
- ☆30Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- An easy-to-use library to extract indices from texts.☆29Updated 3 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 3 months ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆16Updated last month
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 9 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Named entity annotation tool☆28Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- communication sur le moteur de pseudonymisation de la Cour de Cassation☆18Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year