hetpandya / paraphrase-datasets-pretrained-modelsLinks
A collection of preprocessed datasets and pretrained models for generating paraphrases.
☆29Updated 4 years ago
Alternatives and similar repositories for paraphrase-datasets-pretrained-models
Users that are interested in paraphrase-datasets-pretrained-models are comparing it to the libraries listed below
Sorting:
- Abstractive and Extractive Text summarization using Transformers.☆84Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆90Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- Easy to use and understand multiple-choice question generation algorithm using T5 Transformers.☆135Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated last year
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆32Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 9 months ago
- Text2Text Language Modeling Toolkit☆301Updated 6 months ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- MAFAND-MT☆57Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 9 months ago
- ☆103Updated 4 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Fine-tuning GPT-2 Small for Question Answering☆130Updated 2 years ago
- ☆34Updated 4 years ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆142Updated last month
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆53Updated 4 years ago
- This is a neural spell checker☆66Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- NewsQuizQA is a quiz-style question-answer dataset used for generating quiz questions about the news☆35Updated 4 years ago
- https://liuzeming01.github.io/XDailyDialog/☆10Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆103Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago