jantrienes / text-simplification-datasetsLinks
A collection of text simplification datasets and other resources
☆45Updated 9 months ago
Alternatives and similar repositories for text-simplification-datasets
Users that are interested in text-simplification-datasets are comparing it to the libraries listed below
Sorting:
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 5 months ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- Repository for DISRPT2023 shared task☆17Updated 10 months ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated last year
- Controllable Sentence Simplification with T5☆17Updated 2 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆180Updated last year
- Automated Semantic Analysis of Discourse Markers☆10Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Updated 2 years ago
- a tool for calcualting character n-gram F score☆73Updated 2 years ago
- This repository contains the two datasets introduced in the paper "Making Science Simple: Corpora for the Lay Summarisation of Scientific…☆25Updated last year
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆11Updated 2 weeks ago
- ☆230Updated 4 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆24Updated 4 months ago
- Official Implementation for Seq2seq is All You Need For Coreference Resolution Paper☆16Updated last year
- Discourse Probing of Pretrained Language Models. In Proceedings of NAACL 2021.☆10Updated 2 years ago
- ☆52Updated 3 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- ☆15Updated 3 years ago
- Codebase, data and models for the SummaC paper in TACL☆96Updated 4 months ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 3 years ago
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- ☆8Updated 2 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆24Updated last month
- ☆40Updated last year
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆144Updated 2 years ago
- Multilingual Dialogue Datasets☆19Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago