nlacslab / kazdet
NLA-NU Kazakh Dependency Treebank
☆10Updated 6 years ago
Alternatives and similar repositories for kazdet:
Users that are interested in kazdet are comparing it to the libraries listed below
- NLP tools for Kazakh language☆40Updated 4 years ago
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆26Updated 2 weeks ago
- NLP tools for Kazakh language☆31Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 5 months ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- Open Source Kazakh Corpus☆21Updated last year
- A list of initiatives for adding new languages to opensource machine translation models☆17Updated 3 months ago
- ☆13Updated 2 years ago
- ☆10Updated 3 years ago
- Accentor and transcriptor for Russian language☆122Updated 2 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)☆20Updated 3 years ago
- ☆11Updated last year
- ☆23Updated 2 months ago
- python package russtress accentuates russian text☆50Updated 4 years ago
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- ☆23Updated 3 years ago
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆49Updated 3 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- The grapheme to phoneme model converts Kazakh(Arab|Cyrillic) characters to phonemes.☆11Updated 5 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆27Updated 4 months ago
- Apertium linguistic data for Kazakh☆17Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- 1st place solution for GramEval-2020☆14Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Speech analytics package for call-center☆22Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 5 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- A merged version of multiple open-source German speech datasets.☆31Updated 8 months ago
- G2P tool for Russian language with vosk-model-ru styled transcriptions☆9Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated 3 months ago