hetpandya / paraphrase-datasets-pretrained-models
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆29Updated 3 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models:
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆188Updated last year
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆34Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 5 months ago
- A python package to augment text data using NLP.☆40Updated 3 weeks ago
- Abstractive and Extractive Text summarization using Transformers.☆82Updated last year
- ☆38Updated 2 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- ☆102Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆62Updated 9 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- Code for ProtAugment: Unsupervised diverse short-texts paraphrasing for intent detection meta-learning☆21Updated 2 years ago
- Quality Controlled Paraphrase Generation (ACL 2022)☆70Updated 2 years ago
- GupShup: Summarizing Open-Domain Code-Switched Conversations EMNLP 2021☆15Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆51Updated 4 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆32Updated 9 months ago
- MobileBERT and DistilBERT for extractive summarization☆88Updated last year
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆78Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization☆70Updated 3 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- ☆30Updated 4 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 3 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 7 months ago