unlp-workshop / unlp-2024-shared-taskLinks
UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian
☆17Updated last year
Alternatives and similar repositories for unlp-2024-shared-task
Users that are interested in unlp-2024-shared-task are comparing it to the libraries listed below
Sorting:
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆83Updated 4 months ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆215Updated 3 weeks ago
- ☆49Updated last year
- Ukranian NER annotation project☆92Updated 5 months ago
- UNLP 2025 Shared Task on Detecting Social Media Manipulation☆22Updated 2 months ago
- Russian Corpus of Linguistic Acceptability☆46Updated last year
- Extracts parallel corpora from the 2 raw texts in different languages.☆36Updated 2 years ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆19Updated 3 years ago
- Dictionary of obscene words for Ukrainian language☆19Updated 5 months ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆17Updated 5 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆77Updated 3 years ago
- Curriculum training☆18Updated 3 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 6 months ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆71Updated 2 years ago
- ☆139Updated last year
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆34Updated last year
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆20Updated last year
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆264Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆114Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆23Updated 6 years ago
- Python Finite-State Toolkit☆58Updated this week
- Code and dataset for tracing semantic changes in Russian adjectives☆12Updated 5 years ago
- A multilingual parallel corpus created from translations of the Bible.☆189Updated 5 months ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 7 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 4 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆32Updated 3 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆176Updated 4 months ago