babaknaderi / TextComplexityDELinks
TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learners in level B, and 250 sentences with a native speaker's simplification.
โ13Updated 3 years ago
Alternatives and similar repositories for TextComplexityDE
Users that are interested in TextComplexityDE are comparing it to the libraries listed below
Sorting:
- Repository for the Georgetown University Multilayer Corpus (GUM)โ99Updated last week
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15Updated 4 years ago
- This is a simple Python package for calculating a variety of lexical diversity indicesโ79Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapersโ38Updated 3 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rulesโ22Updated 2 years ago
- Code to create the dataset from "A New Aligned Simple German Corpusโ11Updated last year
- CONLL-U to Pandas DataFrameโ31Updated 7 years ago
- Poetry Corpora Annotated on Aesthetic Emotionsโ12Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ82Updated last year
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheniโฆโ12Updated last year
- UIMA CAS processing library written in Pythonโ90Updated 3 months ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).โ14Updated last year
- Python tools for interacting with Wikidataโ154Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataโ94Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) textsโ84Updated last year
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.โ69Updated 4 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.โ19Updated 2 years ago
- Annotation tool for coreferenceโ33Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022โ13Updated last year
- A module to compute textual lexical richness (aka lexical diversity).โ111Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"โ18Updated 4 years ago
- Data for the HIPE 2022 shared task.โ21Updated last year
- โ64Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.โ152Updated last week
- University of Colorado VerbNetโ114Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.โ363Updated 2 years ago
- An easy-to-use library to extract indices from texts.โ29Updated 4 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsโ19Updated 2 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchโ70Updated 3 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEvalโ13โ193Updated last month