dennlinger / klexikon
Klexikon: A German Dataset for Joint Summarization and Simplification
☆16Updated last year
Related projects: ⓘ
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆66Updated 3 years ago
- ☆13Updated 3 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated last month
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 5 months ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆55Updated last year
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆25Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆80Updated 3 weeks ago
- ☆23Updated 4 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Lexical Substitution Framework☆45Updated last year
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- CONLL-U to Pandas DataFrame☆30Updated 6 years ago
- a tool for calcualting character n-gram F score☆65Updated last year
- ☆60Updated 7 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆63Updated last year
- A program to choose transfer languages for cross-lingual learning☆70Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆35Updated 2 years ago
- ☆24Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆80Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆96Updated last year
- ☆64Updated last year
- Alignment and annotation for comparable documents.☆22Updated 5 years ago
- ☆15Updated last year
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Repository for DISRPT2023 shared task☆16Updated last month
- MultiLexNorm 2021 competition system from ÚFAL☆15Updated 2 years ago
- A software for transferring pre-trained English models to foreign languages☆18Updated last year
- Automatically detect errors in annotated corpora.☆45Updated last year