facebookresearch / SentAugmentLinks
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
☆362Updated 3 years ago
Alternatives and similar repositories for SentAugment
Users that are interested in SentAugment are comparing it to the libraries listed below
Sorting:
- ☆346Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- XLNet: fine tuning on RTX 2080 GPU - 8 GB☆154Updated 5 years ago
- ☆323Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆203Updated 3 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Updated 2 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆473Updated 3 years ago
- Pre-Trained Models for ToD-BERT☆293Updated last year
- The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".☆433Updated 11 months ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆283Updated last year
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆313Updated last year
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆532Updated 3 years ago
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆445Updated this week
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆179Updated 4 years ago
- Python code for various NLP metrics☆167Updated 5 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Document ranking via sentence modeling using BERT☆144Updated 2 years ago
- FastFormers - highly efficient transformer models for NLU☆705Updated 3 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190Updated last month
- Question Answering using Albert and Electra☆207Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆203Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago