Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆23Mar 30, 2026Updated last month
Alternatives and similar repositories for fuzzy-search
Users that are interested in fuzzy-search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Feb 2, 2021Updated 5 years ago
- Loghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an acc…☆144Apr 8, 2026Updated last month
- Tools for TICCL☆14Dec 12, 2025Updated 5 months ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14May 23, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ARK minter, binder, resolver☆23May 8, 2026Updated 3 weeks ago
- Self hosting code for Recogito-Studio☆22Apr 13, 2026Updated last month
- ⏱ Superfast ^Advanced wildcards++? | Unique algorithms that was implemented on native unmanaged C++ but easily accessible in .NET via Con…☆28Jul 18, 2021Updated 4 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆40Feb 10, 2026Updated 3 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago
- BlackLab Frontend, a feature-rich corpus search interface for BlackLab.☆24May 21, 2026Updated last week
- Label propagation algorithm for community detection based on node importance and label influence☆12Feb 15, 2018Updated 8 years ago
- Python API for KB data-services☆20Jan 30, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Bias correction for richness in abundance data☆13Apr 20, 2026Updated last month
- C implementation of the Louvain method for community detection in graphs☆11Mar 10, 2020Updated 6 years ago
- For benchmaring community detection algorithms on social networks with meta-data☆17Sep 19, 2014Updated 11 years ago
- Testing out HTR-OCR-Text translation using Google's Tesseract engine in real-time.☆20Oct 6, 2020Updated 5 years ago
- A streaming algorithm for community detection algorithm in very large networks☆15Mar 8, 2017Updated 9 years ago
- Source code for WACV20 paper "Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition".☆16Jun 12, 2020Updated 5 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated 2 years ago
- HTRflow is the underlying engine for our HTR-pipeline☆75Apr 9, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- Pytorch implementation of HTR on IAM dataset (word or line level + CTC loss)☆21Jul 28, 2022Updated 3 years ago
- Typesafe IIIF presentation v3 parsing without external dependencies☆12May 20, 2026Updated last week
- Netherlands eScience Center - Shifting Concepts Through Time project☆28Mar 21, 2022Updated 4 years ago
- Local Community Detection in Multiple Netwrks☆16Feb 27, 2025Updated last year
- A Flask decorator to output RDF using content negotiation.☆16Jul 6, 2020Updated 5 years ago
- The Rookie Text Analysis System☆10Dec 8, 2022Updated 3 years ago
- Auto configure a remote for a fork!☆12Dec 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 8 years ago
- chainer v2 implementation of instance normalization☆11Aug 8, 2018Updated 7 years ago
- Interactive, IIIF powered audio/video media player React components library. Styleguidist Docs: https://samvera-labs.github.io/ramp/☆37Updated this week
- RDF discovery and publication platform☆10May 4, 2026Updated 3 weeks ago
- A utility library intended at providing reader macros for lambdas, arrays, accessors, hash-tables and hash-sets.☆14Feb 12, 2023Updated 3 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago