an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://codeberg.org/proycon/analiticcl)
☆37Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for analiticcl
Users that are interested in analiticcl are comparing it to the libraries listed below
Sorting:
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Python API for KB data-services☆19Jan 30, 2020Updated 6 years ago
- Simple IIIF Search service for OCRed texts☆17Dec 16, 2020Updated 5 years ago
- Fuzzy search modules for searching lists of words in low quality OCR and HTR text.☆23Updated this week
- This repository provide script to do OCR using some basic Deep Learning approach☆10Aug 27, 2020Updated 5 years ago
- Single-liner codes for morphology of all words in Hebrew Bible (Biblia Hebraica Stuttgartensia Amstelodamensis, BHS A)☆12Jun 17, 2017Updated 8 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- A Hebrew Analytical Lexicon based on ETCBC (4c) data☆11Oct 1, 2019Updated 6 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14May 23, 2023Updated 2 years ago
- A knowledge graph system with graph neural network for drug repurposing and disease mechanism.☆18Sep 12, 2025Updated 5 months ago
- Media player developed under the Europeana Media Generic Services Project☆13Sep 7, 2023Updated 2 years ago
- An experimental Python server for scholarly web annotations☆12Sep 8, 2021Updated 4 years ago
- Bias correction for richness in abundance data☆12Aug 18, 2025Updated 6 months ago
- Parser for KAF NAF files written in Python☆16Jul 1, 2021Updated 4 years ago
- Towards a consolidated LOD vocabulary for linguistic annotations☆16Feb 14, 2026Updated 2 weeks ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago
- Python for Linguists and Humanists☆22May 8, 2020Updated 5 years ago
- An oEmbed Service for Stanford University Libraries☆19Updated this week
- Generic Environment for Context-Aware Correction of Orthography☆22Sep 7, 2022Updated 3 years ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 5 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Aug 15, 2024Updated last year
- A workflow system for Natural Language Processing.☆21Oct 17, 2019Updated 6 years ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Embed Mirador in Omeka Classic 2.3+ for building rich IIIF-compliant exhibits☆18Nov 8, 2024Updated last year
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Catch-A - Catching Annotation: An annotation backend and API.☆20Jul 19, 2017Updated 8 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Feb 27, 2026Updated last week
- Project between GitHub, figshare and Mozilla Science Lab.☆67Jul 19, 2019Updated 6 years ago
- adno.app. The ADNO source code. adno.app. Adno is a web application for viewing, editing and sharing narratives and pathways on static im…☆29Feb 18, 2026Updated 2 weeks ago
- Keyword extraction algorithms in Rust☆29Oct 12, 2024Updated last year
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆11Feb 20, 2026Updated 2 weeks ago
- This is the text partitioner project for Python.☆21Dec 11, 2018Updated 7 years ago
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆30Updated this week
- ☆27Feb 2, 2021Updated 5 years ago
- A web client to browse and publish nanopublications.☆34Updated this week
- linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, L…☆25May 26, 2025Updated 9 months ago