proycon / analiticcl
an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction
☆36Updated 2 weeks ago
Alternatives and similar repositories for analiticcl:
Users that are interested in analiticcl are comparing it to the libraries listed below
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆29Updated 3 weeks ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Updated 4 years ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆73Updated last year
- Rust binding to crfsuite☆25Updated 3 years ago
- An Ancient Greek Morphology Tagger☆26Updated last year
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 3 years ago
- Succeeded by SyntaxDot: https://github.com/tensordot/syntaxdot☆25Updated 4 years ago
- Modular Rust transformer/LLM library using Candle☆36Updated 10 months ago
- Process, enhance and evaluate multiple OCR output.☆22Updated 4 months ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated this week
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆15Updated 7 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- Morphological analyzer and lemmatizer for Latin.☆26Updated last month
- An OCR evaluation tool☆65Updated last month
- An efficient implementation of Partitioned Label Trees & its variations for extreme multi-label classification☆85Updated last year
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Rust python bindings for symspell☆18Updated last year
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 3 months ago
- QA-tool for scans with corresponding ALTO-files☆22Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- ☆23Updated last year
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- ☆32Updated 2 years ago
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆40Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year