chapayevdauren / kazakh-language-corpus
Open Source Kazakh Corpus
☆21Updated last year
Related projects: ⓘ
- Apertium linguistic data for Kazakh☆17Updated 10 months ago
- NLP tools for Kazakh language☆31Updated 2 years ago
- python package russtress accentuates russian text☆50Updated 4 years ago
- NLP tools for Kazakh language☆39Updated 3 years ago
- Accentor and transcriptor for Russian language☆118Updated 2 years ago
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆45Updated 3 years ago
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆25Updated last year
- Punctuation and casing restoration for the Russian Language (BERT-based)☆19Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆29Updated last month
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆101Updated 3 years ago
- Нейронная сеть для восстановления пунктуации на русском языке.☆20Updated 2 years ago
- ☆29Updated last year
- ☆23Updated 2 years ago
- ☆77Updated last year
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆12Updated 4 years ago
- Comparing quality and performance of NLP systems for Russian language☆44Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆116Updated 3 years ago
- 🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺☆56Updated 3 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated 2 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Updated last year
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 4 years ago
- ☆44Updated this week
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆30Updated 2 years ago
- Поэтический корпус русского языка☆41Updated 6 years ago
- Speech analytics package for call-center☆22Updated 3 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆59Updated last year
- nlp workshop at datafest siberia 2019☆22Updated last year
- Extracts parallel corpora from the 2 raw texts in different languages.☆34Updated last year
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆40Updated 3 years ago
- Russian RoBERTa☆29Updated 4 years ago