Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆50Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for transformers-data-augmentation
Users that are interested in transformers-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Jun 12, 2023Updated 2 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- ☆65May 11, 2022Updated 4 years ago
- ☆20Apr 1, 2022Updated 4 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for text augmentation method leveraging large-scale language models☆62Dec 20, 2021Updated 4 years ago
- A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)☆28May 21, 2021Updated 5 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆57Jan 18, 2023Updated 3 years ago
- Tokenizer 비교 실험☆11Jan 3, 2022Updated 4 years ago
- https://ailabs.enliple.com/☆105Feb 25, 2021Updated 5 years ago
- ☆12Mar 8, 2020Updated 6 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- A chatbot implemented using RNN and GloVe embeddings whch answers your query crazily☆12Jan 1, 2020Updated 6 years ago
- KoBART chatbot☆45Jun 22, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch code for meta seq2seq learning☆43Jan 14, 2020Updated 6 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- EDA를 한국어 데이터에서도 사용할 수 있도록 WordNet을 추가☆102Apr 29, 2020Updated 6 years ago
- Korean Training Data Set Generator for Google Syntanxnet☆13Jun 27, 2017Updated 8 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Nov 9, 2022Updated 3 years ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- reference pytorch code for intent classification☆44Oct 18, 2024Updated last year
- Data Augmentation Toolkit for Korean text.☆52Nov 16, 2021Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆358Feb 22, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Tutorial for pretraining Korean GPT-2 model☆67Jun 12, 2023Updated 2 years ago
- ☆13Nov 10, 2021Updated 4 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- 🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer☆19Feb 4, 2025Updated last year
- Named Entity Recognition on CoNLL dataset using BiLSTM+CRF implemented with Pytorch☆41Jun 5, 2019Updated 7 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- Selections from EMNLP 2020☆58Jun 4, 2021Updated 5 years ago
- ☆14Sep 29, 2025Updated 8 months ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Facilitate the learning, practicing, and designing of neural text matching models with a user-friendly and interactive interface.☆42Dec 8, 2022Updated 3 years ago
- GluonNLP tutorial for Pycon2019☆14Aug 16, 2019Updated 6 years ago
- introduces an Arabic Jordanian General Tweets (AJGT) Corpus consisted of 1,800 tweets annotated as positive and negative. Modern Standar…☆12Sep 26, 2023Updated 2 years ago