LaSTUS-TALN-UPF / TSAR-2022-Shared-TaskView external linksLinks
TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts
☆10Oct 27, 2022Updated 3 years ago
Alternatives and similar repositories for TSAR-2022-Shared-Task
Users that are interested in TSAR-2022-Shared-Task are comparing it to the libraries listed below
Sorting:
- Annotation Tool for Text Simplification Corpora☆16Oct 5, 2023Updated 2 years ago
- ☆25May 11, 2024Updated last year
- ☆29Nov 23, 2021Updated 4 years ago
- ☆12Apr 26, 2020Updated 5 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Dec 12, 2020Updated 5 years ago
- A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …☆14Jan 25, 2026Updated 3 weeks ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- ☆10May 26, 2022Updated 3 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated 8 months ago
- ☆10Sep 13, 2022Updated 3 years ago
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆10Dec 1, 2023Updated 2 years ago
- GraphOfDocs: Representing multiple documents as a single graph☆21Jun 22, 2022Updated 3 years ago
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆48Sep 26, 2024Updated last year
- ☆15Jul 29, 2024Updated last year
- A series of BERT and Albert model checkpoints trained to reduce gendered correlations in pre-training☆11Oct 22, 2020Updated 5 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Oct 28, 2022Updated 3 years ago
- This project demonstrates how you can enhance standard CRUD operations in your application using Semantic Search mechanism.☆13Oct 23, 2024Updated last year
- A corpus of short answers written by learners of English and graded with CEFR levels☆12Dec 17, 2021Updated 4 years ago
- Code to create the dataset from "A New Aligned Simple German Corpus☆11Jan 8, 2024Updated 2 years ago
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- ISI tutorials☆12Oct 28, 2016Updated 9 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 3 years ago
- 基于多层级语言特征融合的中文文本可读性分级模型☆12Feb 27, 2024Updated last year
- MultiLexNorm 2021 competition system from ÚFAL☆15Dec 30, 2021Updated 4 years ago
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 9 months ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- ☆15Apr 12, 2023Updated 2 years ago
- ☆14Jun 13, 2022Updated 3 years ago
- ☆17Jun 17, 2025Updated 7 months ago
- An implementation of data augmentation methods for natural language processing tasks.☆13Jul 25, 2024Updated last year
- ☆13Jun 7, 2022Updated 3 years ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Apr 8, 2022Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆58Sep 16, 2022Updated 3 years ago
- 中文文本可读性分级数据集☆15Jul 12, 2023Updated 2 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago