marijnkoolen / fuzzy-search
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆20Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for fuzzy-search
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated last year
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Bias correction for richness in abundance data☆9Updated 4 months ago
- Named entity annotation tool☆27Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Named Entity Disambiguation and Linking☆14Updated 5 months ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆15Updated 5 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- Named Entity Recognition☆16Updated this week
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- ☆28Updated 3 years ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆52Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- An OCR evaluation tool☆64Updated last month
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- Detect and align similar passages☆88Updated 2 months ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Python tools for performing various operations on ALTO XML files☆39Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated last year
- ☆33Updated 5 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- ☆32Updated last year
- A suite of batches and tools for OCR tasks.☆71Updated last year
- A Mashup Interface for Text Analysis Operations☆13Updated 2 weeks ago
- Project on the history of genre.☆22Updated 4 years ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- ☆47Updated last week
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago