ybracke / transnormerLinks
A lexical normalizer for historical spelling variants using a transformer architecture.
☆10Updated 10 months ago
Alternatives and similar repositories for transnormer
Users that are interested in transnormer are comparing it to the libraries listed below
Sorting:
- An Easy Annotation Tool for Natural Language Processing☆11Updated last year
- You Actually Look Twice At it☆38Updated last year
- SFST/SMOR/DWDS-based German Morphology☆20Updated last week
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated last week
- A neural dependency parser that does its best☆16Updated last month
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆515Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Python Finite-State Toolkit☆60Updated last month
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆76Updated 2 weeks ago
- Latin BERT☆70Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆56Updated 2 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated last year
- Multi Tier Annotation Search☆12Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆184Updated 8 months ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆34Updated 3 years ago
- A software to detect text reuse with BLAST.☆13Updated 6 years ago
- A character-wise tokenizer for morphologically rich languages☆31Updated 4 months ago
- ☆11Updated 5 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11Updated last year
- Norwegian Transformer Model☆116Updated last month
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Updated 3 years ago
- Detect and align similar passages☆117Updated 4 months ago
- An OCR evaluation tool☆68Updated 5 months ago
- ☆50Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Updated 2 months ago
- Master repository which includes most other OCR-D repositories as submodules☆72Updated 7 months ago
- A small python library to parse and write TSV files generated by the WebAnno software.☆12Updated 9 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- CERberus -- guardian against character errors☆29Updated last year