Helsinki-NLP / UkrainianLT
A collection of links to Ukrainian language tools
☆30Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for UkrainianLT
- Dictionary of obscene words for Ukrainian language☆17Updated 3 years ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆166Updated last month
- Adds word stress to Ukrainian texts☆45Updated last month
- ☆23Updated 2 years ago
- Training scripts for Speech-To-Text models for Ukrainian language☆34Updated last year
- python package russtress accentuates russian text☆50Updated 4 years ago
- SpaCy official Russian model proposal☆31Updated 3 years ago
- Ukrainian instruction-tuned language models and datasets☆84Updated 4 months ago
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆19Updated last month
- ☆27Updated last week
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago
- Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour☆60Updated last year
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆26Updated 2 months ago
- Russian data from the SynTagRus corpus.☆80Updated last week
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆53Updated 6 years ago
- Морфологический анализатор русского языка☆40Updated last year
- Comparing quality and performance of NLP systems for Russian language☆44Updated last year
- ☆26Updated last year
- ☆34Updated 7 years ago
- Russian coreference resolution competition☆10Updated last year
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated last year
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆18Updated last year
- UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian☆13Updated 7 months ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆35Updated 2 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆19Updated 5 years ago
- Scripts for updating pymorphy2 dictionaries☆37Updated 6 months ago
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 4 years ago
- Code and dataset for tracing semantic changes in Russian adjectives☆12Updated 4 years ago
- Ukrainian ELECTRA model☆12Updated last year