marijnkoolen / fuzzy-searchLinks
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆22Updated this week
Alternatives and similar repositories for fuzzy-search
Users that are interested in fuzzy-search are comparing it to the libraries listed below
Sorting:
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆31Updated 3 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago
- A Python library for topic modeling and visualization☆67Updated 5 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆150Updated last year
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 6 years ago
- Detect and align similar passages☆116Updated 4 months ago
- Named entity annotation tool☆28Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 3 years ago
- ☆28Updated 5 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- Named Entity Recognition☆18Updated 9 months ago
- CERberus -- guardian against character errors☆29Updated last year
- Named Entity Disambiguation and Linking☆16Updated last year
- ☆33Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Updated 2 years ago
- UIMA CAS processing library written in Python☆91Updated 2 months ago
- ☆39Updated last year
- Bias correction for richness in abundance data☆12Updated 5 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- Python API for KB data-services☆19Updated 6 years ago
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14Updated 2 years ago
- Project on the history of genre.☆24Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last month
- Preliminary spaCy models for Latin☆14Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆87Updated 3 years ago
- Text Re-use Alignment Visualization☆38Updated 8 years ago