This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification.
☆566Jan 4, 2022Updated 4 years ago
Alternatives and similar repositories for paws
Users that are interested in paws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- syntactically controlled paraphrase networks☆168Dec 30, 2018Updated 7 years ago
- BERT score for text generation☆1,882Jul 30, 2024Updated last year
- New dataset☆311Aug 31, 2021Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,927Feb 14, 2023Updated 3 years ago
- A tool for holistic analysis of language generations systems☆471Sep 22, 2025Updated 6 months ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Oct 20, 2021Updated 4 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆103Dec 5, 2023Updated 2 years ago
- jiant is an nlp toolkit☆1,674Jul 6, 2023Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆652Jan 4, 2023Updated 3 years ago
- Resources for the MRQA 2019 Shared Task☆294Aug 5, 2021Updated 4 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,124Apr 20, 2022Updated 3 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,153Feb 20, 2024Updated 2 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆789Aug 4, 2023Updated 2 years ago
- Examine two sentences and determine whether they have the same meaning.☆223Feb 5, 2019Updated 7 years ago
- Adversarial Natural Language Inference Benchmark☆399May 12, 2022Updated 3 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated 2 years ago
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)☆200Jul 6, 2023Updated 2 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,760Updated this week
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆125Jun 3, 2019Updated 6 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,494Jan 14, 2026Updated 2 months ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆194Sep 22, 2025Updated 6 months ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- Implementation of NeurIPS 19 paper: Paraphrase Generation with Latent Bag of Words☆122Oct 9, 2021Updated 4 years ago
- ☆604Mar 12, 2026Updated last week
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- Phrase-Indexed Question Answering (PIQA)☆93Apr 27, 2019Updated 6 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆105May 6, 2019Updated 6 years ago
- A paraphrase generator built using the T5 model which produces paraphrased English sentences.☆318Updated this week
- InferSent sentence embeddings☆2,280Aug 30, 2021Updated 4 years ago
- Conditional Transformer Language Model for Controllable Generation☆1,884May 1, 2025Updated 10 months ago
- Evaluating Cross-lingual Sentence Representations☆465Aug 30, 2021Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,106Mar 19, 2024Updated 2 years ago
- ☆178Jul 31, 2020Updated 5 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆21Feb 23, 2021Updated 5 years ago
- Language-Agnostic SEntence Representations☆3,660May 2, 2024Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated 2 years ago