Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆135Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for TransformersDataAugmentation
Users that are interested in TransformersDataAugmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Jun 12, 2023Updated 2 years ago
- ☆65May 11, 2022Updated 3 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- ☆13Apr 16, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- code for ACL 2018 paper by Kang et al., "AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples "☆17Aug 30, 2019Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- [EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification☆130Mar 11, 2023Updated 3 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data augmentation for NLP☆4,656Jun 24, 2024Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- ☆12Mar 14, 2022Updated 4 years ago
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,652Mar 19, 2023Updated 3 years ago
- ☆69Feb 4, 2021Updated 5 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Korean Training Data Set Generator for Google Syntanxnet☆13Jun 27, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Oct 15, 2019Updated 6 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 5 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆57Jul 2, 2020Updated 5 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Jun 9, 2023Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19☆29May 4, 2020Updated 5 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- UDA(Unsupervised Data Augmentation) implemented by pytorch☆277Dec 13, 2019Updated 6 years ago
- Data augmentation for NLP, accepted at EMNLP 2021 Findings☆106Nov 30, 2023Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆100Nov 11, 2022Updated 3 years ago
- Synthetic dataset for recommender system created from Naver Movie rating system☆26Dec 8, 2023Updated 2 years ago
- Paper: Relational Sentence Embedding for Flexible Semantic Matching☆12May 22, 2024Updated last year
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago