mikahama / uralicNLPView external linksLinks
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!
☆90Nov 3, 2025Updated 3 months ago
Alternatives and similar repositories for uralicNLP
Users that are interested in uralicNLP are comparing it to the libraries listed below
Sorting:
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.☆13Dec 13, 2023Updated 2 years ago
- The NLG tool for Finnish☆24Dec 13, 2023Updated 2 years ago
- The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!☆29Aug 10, 2024Updated last year
- Open morphology for Finnish☆95Jan 22, 2026Updated 3 weeks ago
- HFST optimized-lookup standalone library and command line tool☆13Feb 27, 2018Updated 7 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Jan 17, 2020Updated 6 years ago
- Script for workflow to add morphological analysis into ELAN files☆14May 15, 2020Updated 5 years ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated 2 weeks ago
- Python 3 library for processing historical English☆68Aug 10, 2024Updated last year
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- Finnish data☆11Nov 12, 2025Updated 3 months ago
- Tools for handling GRNTI list☆10Sep 2, 2023Updated 2 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- HFST spell checker library and command line tool☆14Feb 20, 2024Updated last year
- Sources of Collatinus software - Latin lemmatizer and morphological analyzer☆11Apr 25, 2016Updated 9 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- ☆25Apr 28, 2020Updated 5 years ago
- Russian/English/Estonian/Finnish/Swedish phonetic algorithm based on Soundex and Metaphone☆52Mar 1, 2025Updated 11 months ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- Contextual Lemmatization and Morphological Tagging in 100 different languages. A Participant System for SigMorphon2019 Task 2☆24Jul 25, 2024Updated last year
- ☆51Nov 20, 2017Updated 8 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Feb 4, 2026Updated last week
- WordWanderer – take your text for a walk☆12May 14, 2019Updated 6 years ago
- Central Alaskan Yup'ik FST morphological analyzer/generator☆13Feb 4, 2026Updated last week
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Byte-level byte pair encoding (BPE) in Haskell☆17May 27, 2024Updated last year
- ☆16Jan 20, 2022Updated 4 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 6 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- Overview of Icelandic NLP resources at a glance☆18Jun 20, 2024Updated last year
- Retter aktive nettside fra nynorsk til norsk (bokmål), for økt leseglede.☆21Sep 16, 2025Updated 4 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Faroese language☆18Updated this week
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Tools for the 3rd edition of the Constraint Grammar formalism.☆23Nov 17, 2025Updated 2 months ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Nov 9, 2019Updated 6 years ago
- Unitex/GramLab Language Resources☆18Aug 11, 2022Updated 3 years ago