sigtyp / ST2024
SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages
☆7Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for ST2024
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 7 months ago
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Updated 3 years ago
- Repository for DISRPT2023 shared task☆16Updated 3 months ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 6 months ago
- Compiled tools, datasets, and other resources for historical text normalization.☆16Updated 5 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 4 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 5 months ago
- The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …☆11Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆141Updated last year
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- ParCourE - Parallel Corpus Explorer☆12Updated 2 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- ☆15Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆81Updated last month
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 3 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 3 months ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- ☆36Updated 2 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 2 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆12Updated 3 months ago
- ☆19Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 4 months ago
- Data for the HIPE 2022 shared task.☆15Updated 11 months ago
- Curriculum training☆16Updated last month
- ☆43Updated 3 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- ☆16Updated last year