sigtyp / ST2024Links
SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages
☆9Updated last year
Alternatives and similar repositories for ST2024
Users that are interested in ST2024 are comparing it to the libraries listed below
Sorting:
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Updated 4 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 5 months ago
- Compiled tools, datasets, and other resources for historical text normalization.☆19Updated 6 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- ☆49Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆156Updated 2 months ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated this week
- Efficient Low-Memory Aligner☆146Updated 6 months ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last year
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- ☆65Updated last year
- Easier Automatic Sentence Simplification Evaluation☆161Updated last year
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- The Benchmark of Linguistic Minimal Pairs☆152Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆373Updated last year
- Repository for DISRPT2023 shared task☆17Updated last year
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆19Updated last year
- Poetry Corpora Annotated on Aesthetic Emotions☆11Updated 3 years ago
- ☆74Updated 4 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated this week
- Repository for the Georgetown University Multilayer Corpus (GUM)☆98Updated last week
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- ☆44Updated 3 years ago
- ☆24Updated 4 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year