☆87Oct 19, 2022Updated 3 years ago
Alternatives and similar repositories for taiga_site
Users that are interested in taiga_site are comparing it to the libraries listed below
Sorting:
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆57Dec 8, 2016Updated 9 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆56May 27, 2023Updated 2 years ago
- RuREBus shared task repo☆29Jan 18, 2021Updated 5 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 3 months ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating SOTA mode…☆39Updated this week
- ☆21Jul 28, 2020Updated 5 years ago
- ☆56May 12, 2018Updated 7 years ago
- Russian language models for spaCy☆241Jul 14, 2021Updated 4 years ago
- Question answering on russian with XLMRobertaLarge as a service☆21Nov 6, 2021Updated 4 years ago
- Training GPT-2 on a Russian language corpus☆87Jan 18, 2021Updated 5 years ago
- Rule-based token, sentence segmentation for Russian language☆279Jul 24, 2023Updated 2 years ago
- Differentiable lower bound for BLEU score.☆12Apr 13, 2019Updated 6 years ago
- A Parallel Russian-Simple Russian Dataset☆15Mar 30, 2023Updated 2 years ago
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29May 14, 2025Updated 9 months ago
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 5 years ago
- Mechanical Tsar (Project Halted, Maintainer Needed).☆11Feb 2, 2022Updated 4 years ago
- Word Sense Induction with neural Bi-language Models and symmetric patterns☆12Aug 31, 2018Updated 7 years ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- Russian FrameBank offline resources☆13Mar 27, 2020Updated 5 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- SpaCy official Russian model proposal☆32Jan 24, 2021Updated 5 years ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆371Nov 24, 2021Updated 4 years ago
- ☆19Jun 21, 2020Updated 5 years ago
- Автоматическая обработка естественного языка для студентов 3-4 курсов Школы лингвистики НИУ ВШЭ.☆13Dec 20, 2023Updated 2 years ago
- Accentor and transcriptor for Russian language☆133Jun 19, 2022Updated 3 years ago
- ☆51Nov 20, 2017Updated 8 years ago
- Train punctuation and capitalization models for different languages☆26Apr 2, 2022Updated 3 years ago
- ☆25Jan 17, 2026Updated last month
- Python wrapper for PullEnti☆21Jul 31, 2020Updated 5 years ago
- Python 3 library for reading and writing warc files☆21Jan 29, 2018Updated 8 years ago
- A fork of the official TPU models repo with fixes and a solution of the Kaggle Open Images 2019 Object Detection Challenge☆49Oct 15, 2019Updated 6 years ago
- Multilingual Generative Pretrained Model☆207May 13, 2024Updated last year