facebookresearch / SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
☆363Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for SentAugment
- Python code for various NLP metrics☆167Updated 5 years ago
- New dataset☆299Updated 3 years ago
- ☆344Updated 3 years ago
- Pre-Trained Models for ToD-BERT☆291Updated last year
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆132Updated last year
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆281Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated 10 months ago
- A library to conduct ranking experiments with transformers.☆161Updated last year
- architectures and pre-trained models for long document classification.☆154Updated 3 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆193Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 3 years ago
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆379Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Updated 2 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆336Updated last week
- QED: A Framework and Dataset for Explanations in Question Answering☆115Updated 3 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Updated 3 months ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- ☆181Updated 3 years ago
- A repository of concepts related to neural networks for NLP☆449Updated 4 years ago
- Unsupervised Question answering via Cloze Translation☆218Updated 2 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆225Updated 3 years ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆309Updated last year
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆526Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆291Updated 4 years ago