iamarkaj / Split-and-Rephrase
Break long English Sentence into simple sentences
☆13Updated last year
Alternatives and similar repositories for Split-and-Rephrase:
Users that are interested in Split-and-Rephrase are comparing it to the libraries listed below
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 5 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- ☆14Updated 2 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- ParCourE - Parallel Corpus Explorer☆12Updated 3 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Updated 4 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆75Updated last year
- ☆11Updated 4 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆18Updated 2 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆66Updated last month
- ☆23Updated 10 months ago
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- ☆18Updated 3 years ago
- ☆76Updated 2 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆34Updated 2 months ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆206Updated 6 months ago
- Sequence tagger based on BERT☆20Updated 2 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Efficient Low-Memory Aligner☆141Updated last month
- Bilingual term extractor☆53Updated last year
- Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring☆61Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- State of the art complex word identification models.☆13Updated 5 years ago
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆44Updated last year
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- ☆15Updated last year
- ☆15Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 4 months ago