jungmaier / dirichlet-smoothed-word-embeddings
Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for dirichlet-smoothed-word-embeddings
- Repository for DISRPT2023 shared task☆16Updated 3 months ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆17Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- ☆16Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 6 months ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆19Updated last month
- Poetry Corpora Annotated on Aesthetic Emotions☆11Updated 2 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆62Updated last year
- Organized inventory of research using the Abstract Meaning Representation☆36Updated this week
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- Data Sets and Models for Evaluation of Lexical Semantic Change Detection☆27Updated last year
- ☆15Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Updated last year
- Automated Semantic Analysis of Discourse Markers☆10Updated 2 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 6 months ago
- A program to choose transfer languages for cross-lingual learning☆71Updated last year
- ☆9Updated last year
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 2 years ago
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Updated last year
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 2 years ago
- Code and resources for evaluating cross-lingual embedding spaces☆28Updated 4 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- a tool for calcualting character n-gram F score☆67Updated last year
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆63Updated last year
- A simple library for querying the URIEL typological database.☆88Updated 7 months ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- Reference-free MT Evaluation Metrics☆20Updated 2 years ago