nlacslab / kaznlp
NLP tools for Kazakh language
☆40Updated 4 years ago
Alternatives and similar repositories for kaznlp:
Users that are interested in kaznlp are comparing it to the libraries listed below
- NLP tools for Kazakh language☆31Updated 2 years ago
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆26Updated 2 weeks ago
- NLA-NU Kazakh Dependency Treebank☆10Updated 6 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated 3 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 5 months ago
- Apertium linguistic data for Kazakh☆17Updated last year
- Large silver standart Russian corpus with NER, morphology and syntax markup☆63Updated last year
- A Russian data set for question answering over Wikidata☆47Updated 3 years ago
- Библиотека для извлечения статистик из текстов на русском языке.☆105Updated 2 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆34Updated 3 years ago
- A Python wrapper for the RuWordNet thesaurus☆59Updated 2 months ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆53Updated last year
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆39Updated 3 years ago
- Open Source Kazakh Corpus☆21Updated last year
- Russian SuperGLUE benchmark☆109Updated last year
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- A list of pretrained Transformer models for the Russian language.☆173Updated 4 years ago
- Russian language models for spaCy☆242Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆102Updated 3 years ago
- Compact high quality word embeddings for Russian language☆190Updated last year
- Deep Learning based NLP modeling for Russian language☆227Updated last year
- RuREBus shared task repo☆30Updated 4 years ago
- G2P tool for Russian language with vosk-model-ru styled transcriptions☆9Updated 3 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- Russian language support for NLTK's PunktSentenceTokenizer☆53Updated 5 years ago
- Links to Russian corpora + Python functions for loading and parsing☆289Updated last year
- ☆23Updated 2 months ago
- https://arxiv.org/abs/2201.06499☆28Updated 9 months ago