mikahama / uralicNLP
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!
☆76Updated 3 months ago
Alternatives and similar repositories for uralicNLP:
Users that are interested in uralicNLP are comparing it to the libraries listed below
- The NLG tool for Finnish☆22Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 10 months ago
- Open morphology for Finnish☆88Updated 2 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A character-wise tokenizer for morphologically rich languages☆27Updated 2 weeks ago
- The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!☆25Updated 7 months ago
- Automatically exported from code.google.com/p/foma☆121Updated last month
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated last year
- 3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, G…☆10Updated last year
- Python Finite-State Toolkit☆53Updated 3 weeks ago
- Yet another search platform for linguistic corpora.☆22Updated last week
- Sentence aligner☆112Updated 3 years ago
- Python API to access glottolog/glottolog☆29Updated 4 months ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆37Updated 2 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated last week
- Python framework for processing Universal Dependencies data☆55Updated this week
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated 2 weeks ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆15Updated 5 years ago
- Benchmark Arabic text diacritization dataset☆74Updated 5 years ago
- Helsinki Finite-State Technology (library and application suite)☆128Updated last month
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- ☆64Updated 10 months ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 9 months ago
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- now you can even use apertium from python☆31Updated last year
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- Jupyter notebooks for course "Computational Morphology with HFST".☆18Updated 2 years ago