Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆23Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for fuzzy-search
Users that are interested in fuzzy-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An experimental Python server for scholarly web annotations☆12Sep 8, 2021Updated 4 years ago
- Loghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an acc…☆140Updated this week
- 🕸 YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).☆15Aug 21, 2023Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- A curated list of awesome Citizen Science Projects in the Netherlands☆20May 4, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Image Binarization for improving OCR and HTR☆23Aug 18, 2022Updated 3 years ago
- Command line tool for linking civil registries☆14Feb 13, 2026Updated last month
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14May 23, 2023Updated 2 years ago
- Tropy plugin to import IIIF manifests☆17Mar 11, 2026Updated 2 weeks ago
- This repo contains files downloaded from Transkribus with corresponding suggested OCR improvements (performed using ChatGPT AI).☆19Mar 3, 2026Updated 3 weeks ago
- Effect Size Computation for Meta Analysis☆21Sep 25, 2023Updated 2 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆38Feb 10, 2026Updated last month
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Wikipedia Citations in Wikidata☆10May 6, 2021Updated 4 years ago
- Python API for KB data-services☆19Jan 30, 2020Updated 6 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 3 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- Framework for autonomous vehicle risk assessment☆16May 11, 2023Updated 2 years ago
- HTRflow is the underlying engine for our HTR-pipeline☆73Mar 19, 2026Updated last week
- 🧙♂️📝 JSON-LD web editor, with autocomplete based on the loaded ontologies concepts and properties☆15Apr 22, 2023Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Typesafe IIIF presentation v3 parsing without external dependencies☆12Dec 16, 2025Updated 3 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆28Mar 21, 2022Updated 4 years ago
- A very fast cloud-native static spatial index for 2D points based on a Z-Order space filling curve and BIGMIN search space pruning☆12May 16, 2025Updated 10 months ago
- Program for viewing volumetric 3d image data from CT, MRI scanners and the like. I will try to experiment with many different cutting edg…☆15Oct 6, 2020Updated 5 years ago
- Train a Text Recognition CRNN model with Tensorflow2 & Keras & IAM Dataset. Convolutional Recurrent Neural Network. CTC.☆21May 7, 2020Updated 5 years ago
- RDF discovery and publication platform☆10Mar 2, 2026Updated 3 weeks ago
- Interactive, IIIF powered audio/video media player React components library. Styleguidist Docs: https://samvera-labs.github.io/ramp/☆36Updated this week
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆48Jan 7, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Oct 21, 2022Updated 3 years ago
- A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews☆12Sep 22, 2023Updated 2 years ago
- NLP pipeline software using common workflow language☆35Apr 22, 2019Updated 6 years ago
- Provide RESTful access to SKOS vocabularies☆58Apr 19, 2023Updated 2 years ago
- ☆12Jun 10, 2024Updated last year
- ☆17Jul 14, 2021Updated 4 years ago
- Yet Another Sparql GUI☆20Dec 8, 2025Updated 3 months ago