ai-forever / sage
SAGE: Spelling correction, corruption and evaluation for multiple languages
☆151Updated 3 months ago
Alternatives and similar repositories for sage:
Users that are interested in sage are comparing it to the libraries listed below
- Augmentex — a library for augmenting texts with errors☆62Updated 8 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 5 months ago
- The tiniest sentence encoder for Russian language☆213Updated 7 months ago
- Russian Corpus of Linguistic Acceptability☆42Updated 5 months ago
- "Руформеры" - список популярных базовых моделей на основе трансформеров для решения задач по автоматической обработке русского языка☆36Updated last year
- Effective LLM Alignment Toolkit☆123Updated last week
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆94Updated last year
- A Python wrapper for the RuWordNet thesaurus☆60Updated 3 months ago
- Библиотека для извлечения статистик из текстов на русском языке.☆117Updated 2 years ago
- ☆57Updated last year
- Deep Learning for Speech☆89Updated 2 months ago
- Question answering on russian with XLMRobertaLarge as a service☆21Updated 3 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Updated last year
- BSNLP 2021☆33Updated 4 months ago
- Deep Learning based NLP modeling for Russian language☆230Updated last year
- Large silver standart Russian corpus with NER, morphology and syntax markup☆64Updated last year
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆103Updated 3 years ago
- Russian SuperGLUE benchmark☆109Updated last year
- Rule-based token, sentence segmentation for Russian language☆261Updated last year
- ☆121Updated last year
- Compact high quality word embeddings for Russian language☆196Updated last year
- A list of pretrained Transformer models for the Russian language.☆174Updated 5 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆39Updated 3 months ago
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆35Updated 3 years ago
- Материалы курса по компьютерной лингвистике Школы Лингвистики НИУ ВШЭ☆183Updated last week
- ☆81Updated last year
- Foundational Model for Speech Recognition Tasks☆185Updated 2 weeks ago
- Автоматическая обработка естественного языка для студентов 3-4 курсов Школы лингвистики НИУ ВШЭ.☆14Updated last year
- Links to Russian corpora + Python functions for loading and parsing☆293Updated last year