hetpandya / paraphrase-datasets-pretrained-modelsLinks
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆31Updated 4 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
Sorting:
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- Abstractive and Extractive Text summarization using Transformers.☆86Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 2 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆186Updated 2 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆92Updated 2 years ago
- Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.☆138Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- ☆105Updated 4 years ago
- Text2Text Language Modeling Toolkit☆304Updated 10 months ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆35Updated 4 years ago
- MAFAND-MT☆60Updated last year
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 3 years ago
- A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.☆156Updated 2 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆58Updated last year
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆64Updated 4 years ago
- This is a neural spelling checker☆69Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Updated last year
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Updated 4 years ago
- Quality Controlled Paraphrase Generation (ACL 2022)☆71Updated 2 months ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆56Updated 5 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆89Updated 5 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago