facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
☆362Updated 3 years ago
Alternatives and similar repositories for SentAugment:
Users that are interested in SentAugment are comparing it to the libraries listed below
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆327Updated last year
- ☆345Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆135Updated last year
- New dataset☆303Updated 3 years ago
- Pre-Trained Models for ToD-BERT☆292Updated last year
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆472Updated 2 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆252Updated 2 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Interpretable Evaluation for AI Systems☆363Updated 2 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"☆195Updated 5 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 3 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 10 months ago
- FastFormers - highly efficient transformer models for NLU☆704Updated last week
- Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632☆265Updated 3 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆189Updated 2 years ago
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆231Updated last year
- A repository of concepts related to neural networks for NLP☆452Updated 5 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks