Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)
☆17Jul 16, 2024Updated last year
Alternatives and similar repositories for boyd-wnut2018
Users that are interested in boyd-wnut2018 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Models and training scripts for the English, German and Russian MAGEC systems described in R. Grundkiewicz, M. Junczys-Dowmunt: Minimally…☆12Jul 7, 2021Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Feb 14, 2022Updated 4 years ago
- ☆17Jan 8, 2021Updated 5 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆158Sep 27, 2022Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆232Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Dec 21, 2020Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆462Mar 26, 2024Updated 2 years ago
- Scorer for grammatical error correction systems.☆14Feb 24, 2016Updated 10 years ago
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Apr 6, 2023Updated 3 years ago
- GMEG☆31Nov 21, 2024Updated last year
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 6 years ago
- Language model powered proof reader for correcting contextual errors in natural language.☆24Jul 6, 2023Updated 2 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆964May 21, 2024Updated last year
- Randomly sample lines from massive text files efficiently☆17Apr 1, 2015Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Load subtitles into Netflix☆12Mar 6, 2021Updated 5 years ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task☆93Sep 19, 2019Updated 6 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆31Apr 8, 2021Updated 5 years ago
- cLang-8 is a dataset for grammatical error correction.☆112Jul 19, 2022Updated 3 years ago
- Predict edit intentions on Wikipedia☆19Jan 24, 2019Updated 7 years ago
- Selective and Recursive local Assembler☆15Jan 31, 2022Updated 4 years ago
- 2018 Duolingo Shared Task on Second Language Acquisition Modeling (SLAM) (http://sharedtask.duolingo.com/)☆12May 31, 2018Updated 7 years ago
- Python tools to retrieve text from CommonCrawl WARC files based on cdx index.☆18Feb 18, 2022Updated 4 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆105May 6, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A PyTorch implementation of "Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study"☆50Dec 17, 2018Updated 7 years ago
- The dataset and statistical analysis code released with the submission of EMNLP 2017 paper "Why We Need New Evaluation Metrics for NLG"☆19Nov 16, 2021Updated 4 years ago
- Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data☆251Jun 3, 2020Updated 5 years ago
- ☆13Mar 1, 2019Updated 7 years ago
- ☆34Jul 4, 2018Updated 7 years ago
- Rust python bindings for symspell☆21Dec 25, 2023Updated 2 years ago
- Dataset of spoken conversational search utterances☆14Aug 27, 2021Updated 4 years ago
- Larger-Context NMT☆13Aug 20, 2017Updated 8 years ago
- Improved version of GECToR☆63Jul 24, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- ☆11Sep 8, 2017Updated 8 years ago
- Code and model files for the paper: "A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction" (AAAI-18…☆184Dec 13, 2018Updated 7 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- ☆32Jun 16, 2021Updated 4 years ago
- Introduction to pytorch and NLP☆14Sep 22, 2019Updated 6 years ago
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆19Apr 11, 2023Updated 3 years ago