iamarkaj / Split-and-Rephrase
Break long English Sentence into simple sentences
☆14Updated last year
Alternatives and similar repositories for Split-and-Rephrase:
Users that are interested in Split-and-Rephrase are comparing it to the libraries listed below
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 5 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- ☆47Updated 9 months ago
- ☆25Updated last year
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- ☆23Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated 11 months ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆67Updated 3 months ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated 4 months ago
- VerbNet semantic parser and related utilities☆36Updated 2 years ago
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- Parse Sentences to extract evoked frames.☆10Updated 5 years ago
- ☆12Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆116Updated 5 months ago
- Spoken Language assessment☆43Updated 4 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- A tool to automatically summarize documents abstractively using the BART or PreSumm Machine Learning Model.☆67Updated 4 years ago
- ☆72Updated last month
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆41Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated 3 months ago
- ☆30Updated 4 years ago
- (yet another not really) awesome topic/text segmentation list☆108Updated 6 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Transformer based translation quality estimation☆110Updated last year