hetpandya / paraphrase-datasets-pretrained-models
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆29Updated 3 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models:
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
- Abstractive and Extractive Text summarization using Transformers.☆83Updated last year
- A python package to augment text data using NLP.☆40Updated 2 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆34Updated 4 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆187Updated last year
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆78Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆39Updated last year
- MobileBERT and DistilBERT for extractive summarization☆89Updated last year
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆146Updated last year
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆54Updated 8 months ago
- Hinglish Text Classification☆30Updated last year
- ☆32Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆54Updated 4 years ago
- NTREX -- News Test References for MT Evaluation☆82Updated 10 months ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆90Updated last month
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆32Updated 11 months ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆52Updated 2 years ago
- How to finetune mbart using fairseq☆24Updated 4 years ago
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).☆20Updated 5 years ago
- Japanese Sentence Summarization with BERT☆49Updated last year
- A library of translation-based text similarity measures☆25Updated last year
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆46Updated 2 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago