This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)
☆163Sep 24, 2024Updated last year
Alternatives and similar repositories for C4_200M-synthetic-dataset-for-grammatical-error-correction
Users that are interested in C4_200M-synthetic-dataset-for-grammatical-error-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cLang-8 is a dataset for grammatical error correction.☆112Jul 19, 2022Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Mar 24, 2023Updated 3 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆462Mar 26, 2024Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆127Jan 30, 2026Updated 2 months ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Jul 2, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Sep 26, 2021Updated 4 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆962May 21, 2024Updated last year
- ☆120Sep 9, 2020Updated 5 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Dec 12, 2020Updated 5 years ago
- Improved version of GECToR☆63Jul 24, 2023Updated 2 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆57Mar 4, 2024Updated 2 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆158Sep 27, 2022Updated 3 years ago
- The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"☆29Jul 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codes for the paper "Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding" (ACL-IJCNLP 2021)☆41Jun 7, 2021Updated 4 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Dec 23, 2019Updated 6 years ago
- ☆17Jan 8, 2021Updated 5 years ago
- ☆62Aug 2, 2023Updated 2 years ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆115Jun 11, 2023Updated 2 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆21Dec 14, 2022Updated 3 years ago
- ☆18Sep 16, 2017Updated 8 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Feb 14, 2022Updated 4 years ago
- GMEG☆31Nov 21, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for paper Grammatical Error Correction in Low-Resource Scenarios (W-NUT 2019)☆13Jun 21, 2022Updated 3 years ago
- Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models☆31Apr 27, 2024Updated last year
- NeuSpell: A Neural Spelling Correction Toolkit☆712Jul 31, 2023Updated 2 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆32Aug 2, 2024Updated last year
- evaluation suite for testing automatic grammatical error corrections☆39Jun 12, 2017Updated 8 years ago
- Convert Standard M2 format to parallel sentences.☆22Jun 20, 2020Updated 5 years ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,579Feb 15, 2023Updated 3 years ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆22Jul 10, 2023Updated 2 years ago
- ☆14Jan 21, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14May 17, 2015Updated 10 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.☆49Mar 19, 2019Updated 7 years ago
- Code and model files for the paper: "A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction" (AAAI-18…☆184Dec 13, 2018Updated 7 years ago
- ☆20Dec 31, 2020Updated 5 years ago
- ☆129Nov 3, 2022Updated 3 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago