iamarkaj / Split-and-RephraseLinks
Break long English Sentence into simple sentences
☆14Updated 2 years ago
Alternatives and similar repositories for Split-and-Rephrase
Users that are interested in Split-and-Rephrase are comparing it to the libraries listed below
Sorting:
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 5 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆249Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆217Updated 11 months ago
- Multilingual sentence alignment using sentence embeddings☆120Updated 8 months ago
- A sentence segmenter that actually works!☆306Updated 4 years ago
- ☆103Updated 4 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- implement Chinese Segmentation with a Word-Based Perceptron Algorithm☆8Updated 3 years ago
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆64Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 9 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆220Updated 8 months ago
- Text2Text Language Modeling Toolkit☆301Updated 6 months ago
- cLang-8 is a dataset for grammatical error correction.☆106Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆175Updated 2 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Updated 4 years ago
- ColBERT humor dataset for the task of humor detection, containing 200,000 jokes/news☆72Updated 9 months ago
- Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.☆135Updated 3 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- Automated paraphrases Generation☆36Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 4 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆186Updated 2 years ago
- simple rule based named entity recognition☆42Updated 3 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- (yet another not really) awesome topic/text segmentation list☆109Updated 6 years ago
- OpusFilter - Parallel corpus processing toolkit☆105Updated 2 weeks ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- Improved version of GECToR☆59Updated last year