chapayevdauren / kazakh-language-corpus
Open Source Kazakh Corpus
☆21Updated last year
Alternatives and similar repositories for kazakh-language-corpus:
Users that are interested in kazakh-language-corpus are comparing it to the libraries listed below
- NLP tools for Kazakh language☆31Updated 2 years ago
- ☆22Updated 3 years ago
- NLP tools for Kazakh language☆40Updated 4 years ago
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆26Updated 2 weeks ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated 3 months ago
- Apertium linguistic data for Kazakh☆17Updated last year
- NLA-NU Kazakh Dependency Treebank☆10Updated 6 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 5 months ago
- python package russtress accentuates russian text☆50Updated 4 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆39Updated 3 years ago
- Accentor and transcriptor for Russian language☆122Updated 2 years ago
- Speech analytics package for call-center☆22Updated 3 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 5 months ago
- ☆79Updated 2 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆102Updated 3 years ago
- 1st place solution for GramEval-2020☆14Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- A list of initiatives for adding new languages to opensource machine translation models☆17Updated 3 months ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 4 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆105Updated 2 years ago
- Поэтический корпус русского языка☆41Updated 7 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆63Updated last year
- Morphological Analyzer for Russian 💬☆40Updated 3 years ago
- Russian SuperGLUE benchmark☆109Updated last year
- A Python wrapper for the RuWordNet thesaurus☆59Updated 2 months ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Russian language support for NLTK's PunktSentenceTokenizer☆53Updated 5 years ago