☆36Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for SentEvalRu
Users that are interested in SentEvalRu are comparing it to the libraries listed below
Sorting:
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆52Jul 4, 2018Updated 7 years ago
- Natural language processing tools for English and Russian (postagging, syntax parsing, SRL, NER, language detection etc.)☆64Feb 5, 2026Updated 3 weeks ago
- Differentiable lower bound for BLEU score.☆12Apr 13, 2019Updated 6 years ago
- RuNNE☆12Jul 17, 2024Updated last year
- Datasets for evaluation of keyword extraction in Russian☆31Sep 23, 2020Updated 5 years ago
- Corpus of Russian news articles collected from Lenta.Ru☆146Nov 19, 2022Updated 3 years ago
- Курс по глубокому обучению в обработке естественных языков для магистров компьютерной лингвистики Высшей Школы Экономики☆49Sep 5, 2022Updated 3 years ago
- Accentor and transcriptor for Russian language☆133Jun 19, 2022Updated 3 years ago
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆80Mar 24, 2023Updated 2 years ago
- ☆33Sep 20, 2017Updated 8 years ago
- Dataset collected from popular Russian collective blog Habrahabr.ru☆13Oct 24, 2016Updated 9 years ago
- Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются☆140Jan 15, 2020Updated 6 years ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆157May 22, 2024Updated last year
- Telegran bot for classification bird species by image☆16Jan 9, 2022Updated 4 years ago
- ☆39Nov 16, 2017Updated 8 years ago
- 🔬 Очистка датасетов от мусора (нормализация, препроцессинг)☆41Mar 18, 2021Updated 4 years ago
- Open Source framework for developing Dialog Agents☆20Mar 26, 2018Updated 7 years ago
- A Russian data set for question answering over Wikidata☆49Jun 6, 2021Updated 4 years ago
- nlp workshop at datafest siberia 2019☆22Dec 8, 2022Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆105May 13, 2021Updated 4 years ago
- ☆29Sep 25, 2025Updated 5 months ago
- RDT: Russian Distributional Thesaurus (Русский Дистрибутивный Тезаурус)☆30Feb 28, 2019Updated 7 years ago
- Mini-library for producing graph visualizations from embedding models☆28Sep 10, 2020Updated 5 years ago
- Model for predicting categories of entities by its mentions☆31Jun 23, 2021Updated 4 years ago
- NTI IRS 2019-2020 "y combinator" team repository☆10Mar 21, 2020Updated 5 years ago
- 10 tasks with 3 exercises each based on PyTorch☆106Aug 12, 2020Updated 5 years ago
- Web-ify your word2vec: framework to serve distributional semantic models online☆204Feb 20, 2025Updated last year
- My NLP datasets for Russian language☆386Feb 18, 2023Updated 3 years ago
- A Python wrapper for the RuWordNet thesaurus☆72Nov 27, 2024Updated last year
- Russian names parsers, gender identification and processing tools☆137Dec 6, 2023Updated 2 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Sep 23, 2021Updated 4 years ago
- Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook☆28Mar 16, 2023Updated 2 years ago
- ☆33Feb 14, 2019Updated 7 years ago
- Open STT☆818Mar 11, 2022Updated 3 years ago
- This is a smart chunker for efficient preparing of long document for RAG☆13Jan 21, 2026Updated last month
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Russian Law as Open Data☆48Feb 5, 2026Updated 3 weeks ago
- python port of arc90's readability bookmarklet, updated to match latest readability.js!☆19Sep 13, 2011Updated 14 years ago
- Краулеры для проекта Taiga Corpus и Taiga Parser, скачивание ресурсов из открытых источников☆14Apr 9, 2019Updated 6 years ago