SIGMORPHON 2022 Shared Task on Morpheme Segmentation
☆33Mar 26, 2023Updated 2 years ago
Alternatives and similar repositories for 2022SegmentationST
Users that are interested in 2022SegmentationST are comparing it to the libraries listed below
Sorting:
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆55Apr 2, 2023Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- ☆11Apr 15, 2022Updated 3 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆11Feb 5, 2020Updated 6 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- ☆14May 24, 2022Updated 3 years ago
- Python Finite-State Toolkit☆60Updated this week
- G2P tool for Russian language with vosk-model-ru styled transcriptions☆10Jun 9, 2021Updated 4 years ago
- ☆15Apr 12, 2023Updated 2 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆35Updated this week
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated last month
- ☆19Oct 14, 2021Updated 4 years ago
- A Python toolbox for text based word segmentation☆19Jan 27, 2021Updated 5 years ago
- ☆17Feb 1, 2023Updated 3 years ago
- Second SIGMORPHON Shared Task on Grapheme-to-Phoneme Conversions☆25Jun 7, 2021Updated 4 years ago
- New York Times Word Innovation Types dataset☆21Dec 1, 2020Updated 5 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 3 months ago
- Subregular toolkit for language processing☆23Jul 22, 2020Updated 5 years ago
- ☆28Sep 5, 2024Updated last year
- Eelbrain pipeline to analyze public Alice EEG dataset☆31Nov 29, 2024Updated last year
- Python classes for the Buckeye Corpus☆26Mar 30, 2018Updated 7 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆31Apr 8, 2021Updated 4 years ago
- ☆11Mar 13, 2025Updated 11 months ago
- KoParadigm: Korean Inflectional Paradigm Generator☆57Nov 23, 2022Updated 3 years ago
- Contextual Lemmatization and Morphological Tagging in 100 different languages. A Participant System for SigMorphon2019 Task 2☆24Jul 25, 2024Updated last year
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- Shami Dialect Corpus (SDC)☆29Feb 13, 2018Updated 8 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- Large scale unannotated Korean corpus for unsupervised tasks. (e.g. Language modeling)☆28Aug 11, 2019Updated 6 years ago
- A Praat plug-in for performing interactive phonetic forced alignment☆29Sep 22, 2018Updated 7 years ago
- Keyword spotting and forced alignment in any language☆91Feb 12, 2026Updated 2 weeks ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆29Jul 12, 2021Updated 4 years ago
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Dec 8, 2022Updated 3 years ago
- Useful resources for Mongolian NLP☆196Dec 14, 2024Updated last year
- Deno Library to upload files to GCS and obtain signed url☆11Jan 16, 2024Updated 2 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2021☆10Jul 11, 2021Updated 4 years ago
- Python binding for SRI Language Modeling Toolkit implemented in Cython☆30Jan 24, 2022Updated 4 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆39Nov 6, 2025Updated 3 months ago