iamarkaj / Split-and-Rephrase
Break long English Sentence into simple sentences
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Split-and-Rephrase
- Identifying complex sentences (with more than 2 clauses), detecting clause breakpoints and coverting them to simpler sentences.☆16Updated 4 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆25Updated 4 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆97Updated last year
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆204Updated 3 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- A tiny BERT for low-resource monolingual models☆29Updated last month
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- Multilingual sentence alignment using sentence embeddings☆97Updated this week
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 5 months ago
- coFR: COreference resolution tool for FRench (and singletons).☆24Updated 4 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆23Updated 4 years ago
- Easier Automatic Sentence Simplification Evaluation☆158Updated last year
- simple rule based named entity recognition☆43Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆35Updated last year
- ☆15Updated 2 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆87Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆80Updated 5 years ago
- ParCourE - Parallel Corpus Explorer☆12Updated 2 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆31Updated last week
- ☆22Updated 11 months ago
- Extension of the SentenceSimplification project☆55Updated this week
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆49Updated 4 years ago
- A accurate multilingual word aligner based on LaBSE☆18Updated last year
- Bilingual term extractor☆52Updated 10 months ago
- Efficient Low-Memory Aligner☆137Updated 2 months ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆55Updated 8 months ago