ybracke / transnormerLinks
A lexical normalizer for historical spelling variants using a transformer architecture.
☆10Updated 8 months ago
Alternatives and similar repositories for transnormer
Users that are interested in transnormer are comparing it to the libraries listed below
Sorting:
- An Easy Annotation Tool for Natural Language Processing☆11Updated last year
- SFST/SMOR/DWDS-based German Morphology☆18Updated 2 weeks ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆33Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆507Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated this week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated last month
- You Actually Look Twice At it☆37Updated 10 months ago
- Python Finite-State Toolkit☆60Updated this week
- A neural dependency parser that does its best☆16Updated this week
- Norwegian Transformer Model☆115Updated 11 months ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆20Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆51Updated 2 years ago
- Multi Tier Annotation Search☆12Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 11 months ago
- A character-wise tokenizer for morphologically rich languages☆29Updated last month
- ☆50Updated last year
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 11 months ago
- ☆11Updated 5 years ago
- UIMA CAS processing library written in Python☆90Updated last week
- Norwegian Speech Transformer Models☆18Updated last month
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆11Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆108Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆179Updated 5 months ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆65Updated 3 years ago
- Open German WordNet☆99Updated last month