averkij / a-studio
Lingtrain Alignment Studio is an ML based app for texts alignment on different languages. It can produce parallel corpora and parallel books.
☆233Updated last month
Related projects: ⓘ
- Lingtrain Aligner — ML powered library for the accurate texts alignment.☆118Updated last month
- Deep Learning based NLP modeling for Russian language☆222Updated last year
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆358Updated 2 years ago
- ☆121Updated 3 years ago
- Links to Russian corpora + Python functions for loading and parsing☆280Updated last year
- Rule-based token, sentence segmentation for Russian language☆248Updated last year
- Compact high quality word embeddings for Russian language☆180Updated last year
- Проект для распознавания речи на русском языке на основе pykaldi.☆321Updated last month
- Sentiment analysis library for russian language☆311Updated 10 months ago
- Opendata resources in Russian / Открытые данные на русском языке☆201Updated 2 years ago
- Russian language models for spaCy☆242Updated 3 years ago
- My NLP datasets for Russian language☆347Updated last year
- Rule-based facts extraction for Russian language☆312Updated last year
- Russian names parsers, gender identification and processing tools☆128Updated 9 months ago
- ☆46Updated last year
- Automatic news aggregator in Telegram / Автоматический агрегатор новостей в Телеграме☆184Updated 4 months ago
- ☆109Updated 5 years ago
- Библиотеки и ресурсы для Я ндекс.Диалогов☆293Updated 6 months ago
- Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются☆136Updated 4 years ago
- Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a…☆83Updated 2 months ago
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆90Updated last year
- A web-based engine for creating and annotating textual corpora☆241Updated last year
- Corpus of Russian news articles collected from Lenta.Ru☆140Updated last year
- ☆495Updated 3 years ago
- Подборка ресурсов по машинному обучению☆53Updated 8 years ago
- Библиотека для анализа и генерации стихов на русском языке☆177Updated 7 months ago
- Python SDK for Yandex Speechkit API.☆44Updated 6 months ago
- Russian speech technology links☆188Updated 2 weeks ago
- ☆205Updated 3 years ago
- Russian SuperGLUE benchmark☆106Updated last year