Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
β23Mar 30, 2026Updated 2 months ago
Alternatives and similar repositories for fuzzy-search
Users that are interested in fuzzy-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An experimental Python server for scholarly web annotationsβ12Sep 8, 2021Updated 4 years ago
- πΈ YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).β15Aug 21, 2023Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.β12Feb 27, 2024Updated 2 years ago
- Tools for TICCLβ14Dec 12, 2025Updated 6 months ago
- A curated list of awesome Citizen Science Projects in the Netherlandsβ20May 4, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia descriptionβ¦β11Dec 8, 2022Updated 3 years ago
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).β14Jun 22, 2022Updated 3 years ago
- Image Binarization for improving OCR and HTRβ23Aug 18, 2022Updated 3 years ago
- Command line tool for linking civil registriesβ14Feb 13, 2026Updated 4 months ago
- How About Machine Learning Enhancing Theses? - a pilot discovery projectβ14May 23, 2023Updated 3 years ago
- Tropy plugin to import IIIF manifestsβ17Mar 11, 2026Updated 3 months ago
- ARK minter, binder, resolverβ23May 28, 2026Updated 3 weeks ago
- Self hosting code for Recogito-Studioβ22Apr 13, 2026Updated 2 months ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://β¦β40Feb 10, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A set of workflows for corpus building through OCR, post-correction and normalisationβ49Sep 7, 2022Updated 3 years ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classificationβ¦β14Jul 12, 2017Updated 8 years ago
- Wikipedia Citations in Wikidataβ10May 6, 2021Updated 5 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parserβ¦β49Feb 2, 2026Updated 4 months ago
- Python API for KB data-servicesβ20Jan 30, 2020Updated 6 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)β18May 29, 2022Updated 4 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic aβ¦β18Nov 18, 2024Updated last year
- Fast MRI reconstruction on CUDA GPUsβ10Dec 30, 2023Updated 2 years ago
- CERberus -- guardian against character errorsβ30Feb 15, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- π§ββοΈπ JSON-LD web editor, with autocomplete based on the loaded ontologies concepts and propertiesβ15Apr 22, 2023Updated 3 years ago
- Efficient hOCR toolingβ57Aug 18, 2025Updated 10 months ago
- Code comment watcher that notifies when an issue is closed.β10Oct 18, 2025Updated 8 months ago
- HTRflow is the underlying engine for our HTR-pipelineβ76Apr 9, 2026Updated 2 months ago
- Project between GitHub, figshare and Mozilla Science Lab.β68Jul 19, 2019Updated 6 years ago
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.β13Jun 13, 2021Updated 5 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)β21Jul 28, 2022Updated 3 years ago
- Typesafe IIIF presentation v3 parsing without external dependenciesβ12May 20, 2026Updated 3 weeks ago
- Netherlands eScience Center - Shifting Concepts Through Time projectβ28Mar 21, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Browser based post correction tool for Alto XML filesβ14Sep 20, 2013Updated 12 years ago
- The Rookie Text Analysis Systemβ10Dec 8, 2022Updated 3 years ago
- TREC Core trackβ11Jul 5, 2017Updated 8 years ago
- Program for viewing volumetric 3d image data from CT, MRI scanners and the like. I will try to experiment with many different cutting edgβ¦β15Oct 6, 2020Updated 5 years ago
- OWL 2 SHACL conversion ruleβ18Mar 18, 2024Updated 2 years ago
- chainer v2 implementation of instance normalizationβ11Aug 8, 2018Updated 7 years ago
- Interactive, IIIF powered audio/video media player React components library. Styleguidist Docs: https://samvera-labs.github.io/ramp/β37Jun 11, 2026Updated last week