jungmaier / dirichlet-smoothed-word-embeddings
Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices
☆10Updated 4 years ago
Alternatives and similar repositories for dirichlet-smoothed-word-embeddings:
Users that are interested in dirichlet-smoothed-word-embeddings are comparing it to the libraries listed below
- Repository for DISRPT2023 shared task☆17Updated 8 months ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆17Updated 2 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆11Updated 2 years ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 11 months ago
- Automated Semantic Analysis of Discourse Markers☆10Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆21Updated 6 months ago
- Appraise code used as part of WMT21 human evaluation campaign☆24Updated 2 months ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- Data Sets and Models for Evaluation of Lexical Semantic Change Detection☆28Updated 2 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 3 years ago
- ☆31Updated 3 months ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- Organized inventory of research using the Abstract Meaning Representation☆37Updated this week
- ☆15Updated 2 years ago
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆16Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆28Updated 10 months ago
- End-to-end shallow discourse parser☆20Updated last year
- A simple library for querying the URIEL typological database.☆89Updated last year
- Model in the loop approach for fig lang generation and explainibilty Code and Data for EMNLP 2022 paper FLUTE: Figurative Language Unders…☆12Updated 2 years ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆8Updated last year
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Updated 2 years ago
- ☆23Updated 2 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago