unlp-workshop / unlp-2024-shared-task
UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian
☆13Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for unlp-2024-shared-task
- A collection of links to Ukrainian language tools☆30Updated 2 years ago
- Dictionary of obscene words for Ukrainian language☆17Updated 3 years ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆14Updated 4 years ago
- ☆43Updated 3 months ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆165Updated 3 weeks ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated this week
- Ukranian NER annotation project☆90Updated 7 months ago
- ☆26Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆27Updated last year
- Adds word stress to Ukrainian texts☆45Updated last month
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆7Updated 9 months ago
- розмічений руками морфо’, синт’, кореф’ корпус української мови☆26Updated 2 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago
- Code and dataset for tracing semantic changes in Russian adjectives☆12Updated 4 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Norwegian Speech Transformer Models☆17Updated 3 weeks ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- Здесь собирается каталог ссылок на полезные языковые ресурсы башкирского языка☆13Updated 3 months ago
- ☆23Updated last week
- Custom Russian tokenizer for spaCy☆42Updated 5 years ago
- Data for the HIPE 2022 shared task.☆15Updated 11 months ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆66Updated last year
- ☆23Updated 2 years ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 2 years ago
- Ukrainian instruction-tuned language models and datasets☆84Updated 3 months ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆43Updated last year
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆18Updated last month
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 2 years ago