hetpandya / paraphrase-datasets-pretrained-modelsLinks
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆31Updated 4 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
Sorting:
- Abstractive and Extractive Text summarization using Transformers.☆85Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆91Updated 2 years ago
- Text2Text Language Modeling Toolkit☆301Updated 8 months ago
- Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.☆138Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.☆156Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆31Updated last week
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆185Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- MAFAND-MT☆59Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆56Updated last year
- A python package to augment text data using NLP.☆39Updated 8 months ago
- A web application that interfaces two GEC systems. [web instance is down]☆32Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆56Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated 3 months ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated last year
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆79Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- 💭 Fine-tune a Covid-19 Doctor-like chatbot with GPT2☆51Updated 4 years ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆35Updated 4 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆104Updated last year