facebookresearch / SentAugmentLinks

SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.

☆361

Alternatives and similar repositories for SentAugment

Users that are interested in SentAugment are comparing it to the libraries listed below

Sorting:

facebookresearch / MLQA
New dataset
☆306Updated 3 years ago
uds-lsv / bert-stable-fine-tuning
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆137Updated last year
allenai / naacl2021-longdoc-tutorial
☆345Updated 4 years ago
gcunhase / NLPMetrics
Python code for various NLP metrics
☆168Updated 5 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
JohnGiorgi / DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…
☆380Updated 2 years ago
neulab / InterpretEval
Interpretable Evaluation for (Almost) All NLP Tasks
☆195Updated 2 years ago
bplank / awesome-neural-adaptation-in-NLP
Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632
☆266Updated 4 years ago
facebookresearch / UnsupervisedQA
Unsupervised Question answering via Cloze Translation
☆219Updated 3 years ago
yueyu1030 / COSINE
[NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…
☆203Updated 2 years ago
ricardorei / lightning-text-classification
Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.
☆218Updated 2 years ago
robustness-gym / summvis
SummVis is an interactive visualization tool for text summarization.
☆253Updated 3 years ago
google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆310Updated 5 years ago
Eric-Wallace / interpretability-tutorial-emnlp2020
Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"
☆199Updated 4 years ago
facebookresearch / vizseq
An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)
☆445Updated last month
facebookresearch / unlikelihood_training
Neural Text Generation with Unlikelihood Training
☆309Updated 3 years ago
crux82 / ganbert
Enhancing the BERT training with Semi-supervised Generative Adversarial Networks
☆228Updated 2 years ago
alexa / dialoglue
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
☆283Updated 2 years ago
facebookresearch / ELI5
Scripts and links to recreate the ELI5 dataset.
☆326Updated 3 years ago
facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆397Updated 3 years ago
alexa / bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆473Updated 3 years ago
google-research-datasets / QED
QED: A Framework and Dataset for Explanations in Question Answering
☆117Updated 4 years ago
qipeng / golden-retriever
Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"
☆195Updated 5 years ago
ShilinHe / interpretableNLP
A list of publications on NLP interpretability (Welcome PR)
☆168Updated 4 years ago
AndriyMulyar / bert_document_classification
architectures and pre-trained models for long document classification.
☆155Updated 4 years ago
google-research-datasets / paws
This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…
☆559Updated 3 years ago
Guzpenha / transformer_rankers
A library to conduct ranking experiments with transformers.
☆159Updated 2 years ago
neulab / nn4nlp-concepts
A repository of concepts related to neural networks for NLP
☆454Updated 5 years ago
zphang / bert_on_stilts
Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs
☆107Updated 2 years ago
hellohaptik / multi-task-NLP
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
☆372Updated 2 years ago