iamarkaj / Split-and-Rephrase
Break long English Sentence into simple sentences
☆14Updated last year
Alternatives and similar repositories for Split-and-Rephrase:
Users that are interested in Split-and-Rephrase are comparing it to the libraries listed below
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 5 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆45Updated 2 years ago
- ☆102Updated 3 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- coFR: COreference resolution tool for FRench (and singletons).☆24Updated 4 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆113Updated 4 months ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 10 months ago
- A tiny BERT for low-resource monolingual models☆31Updated 6 months ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Updated 4 years ago
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- ☆72Updated last month
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- A Dataset for Direct Quotation Extraction and Attribution in News Articles.☆13Updated 3 years ago
- ☆47Updated 8 months ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Automated paraphrases Generation☆36Updated 2 years ago
- Parse Sentences to extract evoked frames.☆10Updated 5 years ago
- ☆25Updated last year
- PyTorch implementation of the paper "Dialogue Act Classification with Context-Aware Self-Attention" for dialogue act classification with …☆45Updated last year
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆104Updated last year
- (yet another not really) awesome topic/text segmentation list☆108Updated 6 years ago
- Improved version of GECToR☆60Updated last year
- ☆30Updated 4 years ago
- Spoken Language assessment☆42Updated 4 years ago