google-research-datasets / wiki-atomic-editsView external linksLinks
A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contains ~43 million edits across 8 languages.
☆105May 6, 2019Updated 6 years ago
Alternatives and similar repositories for wiki-atomic-edits
Users that are interested in wiki-atomic-edits are comparing it to the libraries listed below
Sorting:
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆123Jun 3, 2019Updated 6 years ago
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Dec 9, 2017Updated 8 years ago
- Predict edit intentions on Wikipedia☆19Jan 24, 2019Updated 7 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆17Jul 16, 2024Updated last year
- Automatic extraction of edited sentences from text edition histories.☆83Feb 14, 2022Updated 4 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 4 years ago
- 25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural languag…☆85Oct 9, 2018Updated 7 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- GMEG☆31Nov 21, 2024Updated last year
- The Rainbow Parser☆17Mar 5, 2018Updated 7 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆561Jan 4, 2022Updated 4 years ago
- ☆48Jun 8, 2020Updated 5 years ago
- Performance Prediction for NLP Tasks☆17May 5, 2020Updated 5 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 4 years ago
- Legacy version of CNN neural net toolkit (now called dynet)☆19Oct 8, 2016Updated 9 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆458Mar 26, 2024Updated last year
- syntactically controlled paraphrase networks☆168Dec 30, 2018Updated 7 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 5 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆58Sep 16, 2022Updated 3 years ago
- This is the Grammarly's Yahoo Answers Formality Corpus☆108Jul 7, 2025Updated 7 months ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Language model powered proof reader for correcting contextual errors in natural language.☆24Jul 6, 2023Updated 2 years ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task☆92Sep 19, 2019Updated 6 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,155Feb 20, 2024Updated last year
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- TREC Core track☆11Jul 5, 2017Updated 8 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- HPYLMのC++実装☆11May 2, 2017Updated 8 years ago
- ☆94Feb 13, 2024Updated 2 years ago
- A dataset of sentences with ordinal labels for grammaticality☆29Jun 9, 2014Updated 11 years ago
- 👿→😈☆25Dec 19, 2017Updated 8 years ago
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Aug 29, 2018Updated 7 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 2 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98May 12, 2020Updated 5 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆71May 22, 2023Updated 2 years ago
- DeNSe parser in Dependency Parsing as Head Selection (EACL 2017) https://arxiv.org/abs/1606.01280☆25Apr 27, 2017Updated 8 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 2 years ago