averkij / lingtrain-aligner
Lingtrain Aligner — ML powered library for the accurate texts alignment.
☆127Updated last week
Alternatives and similar repositories for lingtrain-aligner:
Users that are interested in lingtrain-aligner are comparing it to the libraries listed below
- Lingtrain Alignment Studio is an ML based app for texts alignment on different languages. It can produce parallel corpora and parallel bo…☆254Updated last week
- python package russtress accentuates russian text☆51Updated 4 years ago
- Deep Learning based NLP modeling for Russian language☆228Updated last year
- Seman is a set of linguistic tools to analyze Russian or German texts, it contains lexicons and grammars. The project is interesting as a…☆84Updated 7 months ago
- Russian language models for spaCy☆242Updated 3 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆64Updated last year
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆152Updated 9 months ago
- A Python wrapper for the RuWordNet thesaurus☆59Updated 2 months ago
- Rule-based token, sentence segmentation for Russian language☆257Updated last year
- Russian SuperGLUE benchmark☆109Updated last year
- RuLeanALBERT is a pretrained masked language model for the Russian language that uses a memory-efficient architecture.☆93Updated last year
- Extracts parallel corpora from the 2 raw texts in different languages.☆35Updated 2 years ago
- Accentor and transcriptor for Russian language☆122Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆118Updated 3 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆103Updated 3 years ago
- Compact high quality word embeddings for Russian language☆192Updated last year
- ☆57Updated last year
- Лемматизатор для русскоязычных текстов☆44Updated 4 years ago
- Comparing quality and performance of NLP systems for Russian language☆46Updated last year
- Russian Corpus of Linguistic Acceptability☆42Updated 4 months ago
- Библиотека для извлечения статистик из текстов на русском языке.☆116Updated 2 years ago
- Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по …☆364Updated 3 years ago
- Russian names parsers, gender identification and processing tools☆129Updated last year
- ☆83Updated 2 years ago
- Корпус ненормативной лексики русского языка для нужд NLP. Любые исправления и дополнения приветствуются☆136Updated 5 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 4 months ago
- Russian language support for NLTK's PunktSentenceTokenizer☆53Updated 5 years ago
- ☆29Updated 6 years ago
- "Руформеры" - список популярных базовых моделей на основе трансформеров для решения задач по автоматической обработке русского языка☆36Updated last year
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆52Updated 4 years ago