sigtyp / ST2024Links

SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages

☆9

Alternatives and similar repositories for ST2024

Users that are interested in ST2024 are comparing it to the libraries listed below

Sorting:

elexis-eu / MWSA
Datasets for the Monolingual Word Sense Alignment (MWSA) task
☆12Updated 4 years ago
naverlabseurope / ALPS2024-MT-LAB
CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022
☆13Updated last year
juliarodina / RuSemShift
Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…
☆14Updated 5 months ago
coastalcph / histnorm
Compiled tools, datasets, and other resources for historical text normalization.
☆19Updated 6 years ago
sigmorphon / 2022SegmentationST
SIGMORPHON 2022 Shared Task on Morpheme Segmentation
☆26Updated 2 years ago
rewicks / ersatz
☆49Updated last year
thammegowda / mtdata
A tool that locates, downloads, and extracts machine translation corpora
☆156Updated 2 months ago
Helsinki-NLP / OpusFilter
OpusFilter - Parallel corpus processing toolkit
☆109Updated this week
robertostling / eflomal
Efficient Low-Memory Aligner
☆146Updated 6 months ago
antonisa / inflection
Morphological Inflection for Low-Resource Languages using cross-lingual transfer
☆20Updated 5 years ago
thompsonb / prism
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆101Updated last year
SapienzaNLP / clubert
Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.
☆10Updated 4 years ago
jonorthwash / ud-annotatrix
☆65Updated last year
feralvam / easse
Easier Automatic Sentence Simplification Evaluation
☆161Updated last year
andreasvc / dutchcoref
Dutch coreference resolution & dialogue analysis using deterministic rules
☆21Updated 2 years ago
UniversalAnaphora / UniversalAnaphora
An initiative to collect and distribute resources for co-reference resolution in a unified standard.
☆25Updated last year
alexwarstadt / blimp
The Benchmark of Linguistic Minimal Pairs
☆152Updated 2 years ago
cisnlp / simalign
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
☆373Updated last year
disrpt / sharedtask2023
Repository for DISRPT2023 shared task
☆17Updated last year
cisnlp / GlotScript
🖋 Resource and Tool for Writing System Identification -- LREC 2024
☆19Updated last year
tnhaider / poetry-emotion
Poetry Corpora Annotated on Aesthetic Emotions
☆11Updated 3 years ago
Helsinki-NLP / OpusTools
☆74Updated 4 months ago
nert-nlp / streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
☆66Updated this week
amir-zeldes / gum
Repository for the Georgetown University Multilayer Corpus (GUM)
☆98Updated last week
tsproisl / textcomplexity
Linguistic and stylistic complexity measures for (literary) texts
☆82Updated last year
Hyperparticle / udify
A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…
☆223Updated 2 years ago
AmericasNLP / americasnlp2021
☆44Updated 3 years ago
tsamardzic / nonstandard
☆24Updated 4 years ago
aetting / lm-diagnostics
Diagnostic tests for linguistic capacities in language models
☆66Updated 3 years ago
bitextor / bicleaner
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
☆158Updated last year