babaknaderi / TextComplexityDE
TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learners in level B, and 250 sentences with a native speaker's simplification.
☆13Updated 3 years ago
Alternatives and similar repositories for TextComplexityDE:
Users that are interested in TextComplexityDE are comparing it to the libraries listed below
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 9 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- An easy-to-use library to extract indices from texts.☆29Updated 3 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- Repository for the Georgetown University Multilayer Corpus (GUM)☆93Updated last month
- Code to create the dataset from "A New Aligned Simple German Corpus☆10Updated last year
- GSRL is a seq2seq model for end-to-end dependency- and span-based SRL (IJCAI2021).☆18Updated 3 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆14Updated 10 months ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- An annotated corpus of argumentative microtexts☆39Updated 2 years ago
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- Klexikon: A German Dataset for Joint Summarization and Simplification☆17Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆67Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆21Updated 6 months ago
- Data and download script to accompany LREC2020 paper "Automated Fact-Checking of Claims from Wikipedia"☆13Updated last year
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Project CARE☆17Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆75Updated last year
- Generating claims for zero-shot scientific fact checking☆30Updated 3 years ago
- Extension of the SentenceSimplification project☆59Updated 3 weeks ago
- X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual …☆15Updated 4 years ago
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Updated 9 months ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆24Updated 9 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆179Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago