SapienzaNLP / xl-wsd-code
Code to train and test Word Sense Disambiguation models based on different pretrained transformers.
☆13Updated 2 years ago
Related projects: ⓘ
- Experiment code for the ACL 2020 paper "Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders".☆50Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 2 years ago
- Codebase for probing and visualizing multilingual models.☆45Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- ☆16Updated 3 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆35Updated last year
- State of the art complex word identification models.☆13Updated 4 years ago
- ☆32Updated 3 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- ☆28Updated 3 months ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆80Updated 3 weeks ago
- Pretraining scripts for BART transformer model☆11Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆20Updated last year
- GMEG☆29Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- Coreference Resolution With Entity Equalization☆40Updated last year
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆62Updated 4 years ago
- ☆14Updated 2 years ago
- ☆21Updated 3 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- Terminology Dataset☆22Updated 4 years ago
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆16Updated last month
- ☆16Updated 3 years ago
- Massively Multilingual Transfer for NER☆85Updated 2 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆40Updated 9 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆66Updated 3 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆13Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year