facebookresearch / SentAugmentLinks
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
☆361Updated 3 years ago
Alternatives and similar repositories for SentAugment
Users that are interested in SentAugment are comparing it to the libraries listed below
Sorting:
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆228Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- New dataset☆308Updated 4 years ago
- ☆345Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆205Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated last month
- SummVis is an interactive visualization tool for text summarization.☆253Updated 3 years ago
- Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.☆219Updated 2 years ago
- Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632☆264Updated 4 years ago
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆446Updated 4 months ago
- Scripts and links to recreate the ELI5 dataset.☆326Updated 4 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Updated 3 years ago
- Unsupervised Question answering via Cloze Translation☆219Updated 3 years ago
- Python code for various NLP metrics☆169Updated 6 years ago
- Neural Text Generation with Unlikelihood Training☆310Updated 4 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆200Updated 4 years ago
- A library to conduct ranking experiments with transformers.☆160Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- Adversarial Natural Language Inference Benchmark☆396Updated 3 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆648Updated 2 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆180Updated 4 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆228Updated 4 years ago
- A repository of concepts related to neural networks for NLP☆455Updated last month
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317Updated 5 years ago
- Interpretable Evaluation for AI Systems☆365Updated 2 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- A Corpus for Multilingual Document Classification in Eight Languages.☆152Updated 3 years ago
- ☆96Updated 5 years ago