Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆134Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for TransformersDataAugmentation
Users that are interested in TransformersDataAugmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆50Jun 12, 2023Updated 2 years ago
- ☆65May 11, 2022Updated 4 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- ☆13Apr 16, 2021Updated 5 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- code for ACL 2018 paper by Kang et al., "AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples "☆17Aug 30, 2019Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Code for text augmentation method leveraging large-scale language models☆62Dec 20, 2021Updated 4 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆130Mar 11, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Aug 12, 2020Updated 5 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- ☆12Mar 14, 2022Updated 4 years ago
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,652Mar 19, 2023Updated 3 years ago
- ☆69Feb 4, 2021Updated 5 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Korean Training Data Set Generator for Google Syntanxnet☆13Jun 27, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Oct 15, 2019Updated 6 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 5 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆57Jul 2, 2020Updated 5 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Jun 9, 2023Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19☆29May 4, 2020Updated 6 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆278Dec 13, 2019Updated 6 years ago
- Data augmentation for NLP, accepted at EMNLP 2021 Findings☆106Nov 30, 2023Updated 2 years ago
- Synthetic dataset for recommender system created from Naver Movie rating system☆26Dec 8, 2023Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆100Nov 11, 2022Updated 3 years ago
- Paper: Relational Sentence Embedding for Flexible Semantic Matching☆12May 22, 2024Updated 2 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago