unlp-workshop / unlp-2024-shared-task
UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian
☆17Updated last year
Alternatives and similar repositories for unlp-2024-shared-task:
Users that are interested in unlp-2024-shared-task are comparing it to the libraries listed below
- UNLP 2025 Shared Task on Detecting Social Media Manipulation☆19Updated 2 weeks ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆195Updated last week
- Dictionary of obscene words for Ukrainian language☆18Updated 3 years ago
- Ukranian NER annotation project☆90Updated last week
- Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour☆61Updated last year
- Ukrainian instruction-tuned language models and datasets☆95Updated 9 months ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Updated 5 years ago
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆19Updated last year
- A collection of links to Ukrainian language tools☆35Updated 2 years ago
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆260Updated last year
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆20Updated 6 months ago
- ☆27Updated 5 months ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆15Updated 4 years ago
- розмічений руками морфо’, синт’, кореф’ корпус української мови☆27Updated 2 years ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆8Updated last year
- A collection of datasets for Ukrainian language☆58Updated 9 months ago
- Adds word stress to Ukrainian texts☆51Updated 6 months ago
- Браунський корпус української мови☆113Updated this week
- Code and dataset for tracing semantic changes in Russian adjectives☆12Updated 5 years ago
- Russian data from the SynTagRus corpus.☆81Updated 5 months ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆36Updated 2 years ago
- Probing suite for evaluation of Russian embedding and language models☆33Updated 6 months ago
- ☆20Updated 8 years ago
- the list of ~2000 ukrainian stopwords (with numbers)☆61Updated 3 years ago
- ☆47Updated 9 months ago
- Материалы курса "Компьютерная лингвистика и информационные технологии" для 4-го курса бакалавриата направления "Фундаментальная и приклад…☆9Updated 4 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆29Updated last year
- ☆25Updated last month
- Simple library to work with pre-trained ELMo models in TensorFlow☆52Updated last year