makazhan / kaznlp
NLP tools for Kazakh language
☆31Updated 2 years ago
Related projects: ⓘ
- NLP tools for Kazakh language☆39Updated 3 years ago
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆25Updated last year
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆15Updated 5 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated 2 years ago
- "Rossiya Segodnya" news dataset☆45Updated 4 years ago
- ☆77Updated last year
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆47Updated 2 years ago
- ☆29Updated last year
- ☆36Updated last year
- Russian RoBERTa☆29Updated 4 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Updated 2 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆52Updated last year
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 3 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆56Updated 7 years ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆12Updated 4 years ago
- A Russian data set for question answering over Wikidata☆46Updated 3 years ago
- Russian Corpus of Linguistic Acceptability☆40Updated last year
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆45Updated 3 weeks ago
- python package russtress accentuates russian text☆50Updated 4 years ago
- Open Source Kazakh Corpus☆21Updated last year
- Материалы курса "Компьютерная лингвистика и информационные технологии" для 4-го курса бакалавриата направления "Фундаментальная и приклад…☆9Updated 3 years ago
- NLP course @ CS Faculty, HSE☆15Updated 4 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆101Updated 3 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆59Updated last year
- BSNLP 2021☆32Updated 2 years ago
- Pipeline for easy fine-tuning of BERT architecture for sequence classification☆22Updated last year
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 4 years ago
- Apertium linguistic data for Kazakh☆17Updated 10 months ago
- Custom Russian tokenizer for spaCy☆42Updated 5 years ago
- Russian FrameBank offline resources☆13Updated 4 years ago