hetpandya / paraphrase-datasets-pretrained-modelsLinks
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆30Updated 4 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
Sorting:
- Abstractive and Extractive Text summarization using Transformers.☆85Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Text2Text Language Modeling Toolkit☆301Updated 8 months ago
- A tiny BERT for low-resource monolingual models☆31Updated 11 months ago
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆185Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.☆137Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 11 months ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 5 years ago
- MobileBERT and DistilBERT for extractive summarization☆91Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆180Updated 9 months ago
- ☆104Updated 4 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated 2 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆56Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆95Updated 6 months ago
- A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.☆156Updated 2 years ago
- MAFAND-MT☆57Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 2 years ago
- A python package to augment text data using NLP.☆39Updated 7 months ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆47Updated 2 years ago
- Data labeling using few shot learning GPT-3.☆25Updated 2 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆79Updated 4 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆104Updated last year