mikahama / murre
The amazing πwill normalize non-standard Finnish/Swedish and dialectalize standard Finnish!
β25Updated 3 months ago
Related projects β
Alternatives and complementary repositories for murre
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.β11Updated 10 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanβ¦β70Updated this week
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be β¦β15Updated 8 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.β22Updated last year
- Script for workflow to add morphological analysis into ELAN filesβ13Updated 4 years ago
- Featurize words into orthographic and phonological vectors.β40Updated last year
- ParlaMint: Comparable Parliamentary Corporaβ45Updated 3 weeks ago
- Python 3 library for processing historical Englishβ64Updated 3 months ago
- eXtensible Interlinear Glossed Textβ31Updated 2 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeCβ12Updated 2 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.β124Updated 3 years ago
- You Actually Look Twice At itβ29Updated last month
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated last year
- Named entity annotation toolβ27Updated last year
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spβ¦β29Updated 2 years ago
- A simple toolkit for conducting analyses using corpus methodsβ24Updated 2 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Hoβ¦β18Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis Systemβ39Updated last year
- The NLG tool for Finnishβ22Updated 10 months ago
- A Python library for topic modeling and visualizationβ64Updated 4 years ago
- A tool for automatic spelling normalizationβ20Updated 3 years ago
- Open morphology for Finnishβ84Updated last month
- Python version for Doug Biber's Multidimensional Analysis (MDA)β27Updated 4 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpusβ17Updated 5 months ago
- Netherlands eScience Center - Shifting Concepts Through Time projectβ26Updated 2 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognatesβ43Updated last year
- R package for stylometric analysesβ173Updated 2 months ago
- A part-of-speech tagger with support for domain adaptation and external resources.β22Updated 2 years ago
- A web-based, token-level annotation tool for non-standard language dataβ10Updated 4 years ago
- The curation repository for the data behind Concepticon.β32Updated 2 weeks ago