comphist / norma
A tool for automatic spelling normalization
☆20Updated 4 years ago
Alternatives and similar repositories for norma:
Users that are interested in norma are comparing it to the libraries listed below
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Compiled tools, datasets, and other resources for historical text normalization.☆17Updated 5 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- A simple configurable tool for manipulating dependency trees.☆13Updated last month
- Multi Tier Annotation Search☆26Updated 3 years ago
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 8 months ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- The Mueller Report Corpus V 0.1☆11Updated 4 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated last week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆74Updated 2 weeks ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- spaCy-to-naf converter☆21Updated 8 months ago
- Python framework for processing Universal Dependencies data☆55Updated last week
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated last month
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆10Updated 2 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 8 months ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆36Updated 5 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆55Updated last month
- Repository for the Georgetown University Multilayer Corpus (GUM)☆91Updated this week
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- Software for multi-level annotation of linguistic corpora☆17Updated 5 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year