Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆135Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for TransformersDataAugmentation
Users that are interested in TransformersDataAugmentation are comparing it to the libraries listed below
Sorting:
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Jun 12, 2023Updated 2 years ago
- ☆65May 11, 2022Updated 3 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- ☆13Apr 16, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- code for ACL 2018 paper by Kang et al., "AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples "☆17Aug 30, 2019Updated 6 years ago
- Code for text augmentation method leveraging large-scale language models☆61Dec 20, 2021Updated 4 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆130Mar 11, 2023Updated 3 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- ☆12Mar 14, 2022Updated 4 years ago
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,651Mar 19, 2023Updated 3 years ago
- ☆69Feb 4, 2021Updated 5 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Korean Training Data Set Generator for Google Syntanxnet☆13Jun 27, 2017Updated 8 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 4 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆57Jul 2, 2020Updated 5 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Jun 9, 2023Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19☆29May 4, 2020Updated 5 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆278Dec 13, 2019Updated 6 years ago
- Data augmentation for NLP, accepted at EMNLP 2021 Findings☆106Nov 30, 2023Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆100Nov 11, 2022Updated 3 years ago
- Synthetic dataset for recommender system created from Naver Movie rating system☆26Dec 8, 2023Updated 2 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- NAACL 2021 - Progressive Generation of Long Text☆82Oct 2, 2020Updated 5 years ago