hetpandya / paraphrase-datasets-pretrained-modelsLinks
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆29Updated 3 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
Sorting:
- Zero-shot Transfer Learning from English to Arabic☆29Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated last year
- A python package to augment text data using NLP.☆39Updated 4 months ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆34Updated 4 years ago
- Abstractive and Extractive Text summarization using Transformers.☆84Updated 2 years ago
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Updated 4 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Updated 4 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆53Updated 4 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆54Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 3 months ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- FRAKE: Fusional Real-time Automatic Keyword Extraction☆21Updated last year
- ☆34Updated 4 years ago
- A question-answering dataset with a focus on subjective information☆45Updated last year
- ☆38Updated 2 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆79Updated 3 years ago
- simple rule based named entity recognition☆42Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆60Updated 4 years ago
- Use-cases of Hugging Face's BERT (e.g. paraphrase generation, unsupervised extractive summarization).☆20Updated 5 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Quality Controlled Paraphrase Generation (ACL 2022)☆70Updated last month
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year