jantrienes / text-simplification-datasetsLinks
A collection of text simplification datasets and other resources
☆51Updated last year
Alternatives and similar repositories for text-simplification-datasets
Users that are interested in text-simplification-datasets are comparing it to the libraries listed below
Sorting:
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 3 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆188Updated 2 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated last year
- Repository for DISRPT2023 shared task☆17Updated last year
- Controllable Sentence Simplification with T5☆18Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆166Updated 2 years ago
- ☆10Updated 3 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Updated 3 years ago
- Annotation Tool for Text Simplification Corpora☆16Updated 2 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- Conversion scripts for coreference☆28Updated last year
- How to finetune mbart using fairseq☆25Updated 5 years ago
- ☆231Updated 4 years ago
- This repository contains the two datasets introduced in the paper "Making Science Simple: Corpora for the Lay Summarisation of Scientific…☆27Updated last year
- Codebase, data and models for the SummaC paper in TACL☆108Updated last year
- Discourse Probing of Pretrained Language Models. In Proceedings of NAACL 2021.☆10Updated 3 years ago
- ☆55Updated 3 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆409Updated last year
- Lexical Substitution Framework☆46Updated 2 years ago
- Utility for behavioral and representational analyses of Language Models☆173Updated this week
- a tool for calcualting character n-gram F score☆77Updated 3 years ago
- A python package of common operations for AMRs☆29Updated 3 years ago
- A simple library for querying the URIEL typological database.☆95Updated last year
- Multilingual Dialogue Datasets☆19Updated 3 years ago
- This repository houses the IMPlicature and PRESupposition diagnostic dataset (IMPPRES), consisting of >25k semiautomatically generated se…☆19Updated 4 years ago
- Automated Semantic Analysis of Discourse Markers☆11Updated 3 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Updated 2 years ago
- A reading list of up-to-date papers on NLP for Social Good.☆305Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆386Updated 2 years ago
- Diagnostic tests for linguistic capacities in language models☆65Updated 3 years ago