Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆135Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for TransformersDataAugmentation
Users that are interested in TransformersDataAugmentation are comparing it to the libraries listed below
Sorting:
- ☆65May 11, 2022Updated 3 years ago
- ☆13Apr 16, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- ☆11Aug 12, 2020Updated 5 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆32Sep 27, 2021Updated 4 years ago
- ☆69Feb 4, 2021Updated 5 years ago
- Synthetic dataset for recommender system created from Naver Movie rating system☆26Dec 8, 2023Updated 2 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- Korean Training Data Set Generator for Google Syntanxnet☆13Jun 27, 2017Updated 8 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 4 years ago
- 11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.☆57Jul 2, 2020Updated 5 years ago
- Code for text augmentation method leveraging large-scale language models☆61Dec 20, 2021Updated 4 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆61Oct 10, 2022Updated 3 years ago
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021☆36May 8, 2021Updated 4 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,650Mar 19, 2023Updated 2 years ago
- Cross-lingual GLUE☆49Jun 15, 2023Updated 2 years ago
- BERT-related papers☆2,040Aug 12, 2023Updated 2 years ago
- WebConf 2020 paper Leading Conversational Search by Suggesting Useful Questions☆33May 4, 2020Updated 5 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Jun 9, 2021Updated 4 years ago
- Accelerated Reinforcement Learning for Sentence Generation by Vocabulary Prediction☆70Jun 12, 2020Updated 5 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Dec 6, 2024Updated last year
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆101Nov 11, 2022Updated 3 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- MULTI GPU환경에서 ETRI 한국어 BERT모델 활용한 Korquad 학습 방법☆29Mar 16, 2020Updated 5 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- https://challenge.enliple.com/☆16Jun 10, 2020Updated 5 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- Zero-shot Entity Linking with blitz start in 3 minutes. Hard negative mining and encoder for all entities are also included in this imple…☆32Jun 12, 2023Updated 2 years ago
- SUM-QE, a BERT-based Summary Quality Estimation Model☆21Jul 22, 2023Updated 2 years ago