Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆50Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for transformers-data-augmentation
Users that are interested in transformers-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Jun 12, 2023Updated 2 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- ☆65May 11, 2022Updated 4 years ago
- ☆20Apr 1, 2022Updated 4 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for text augmentation method leveraging large-scale language models☆62Dec 20, 2021Updated 4 years ago
- A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)☆28May 21, 2021Updated 5 years ago
- exBERT on Transformers🤗☆10Jun 14, 2021Updated 4 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆57Jan 18, 2023Updated 3 years ago
- Tokenizer 비교 실험☆11Jan 3, 2022Updated 4 years ago
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- https://ailabs.enliple.com/☆105Feb 25, 2021Updated 5 years ago
- ☆12Mar 8, 2020Updated 6 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A chatbot implemented using RNN and GloVe embeddings whch answers your query crazily☆12Jan 1, 2020Updated 6 years ago
- KoBART chatbot☆45Jun 22, 2021Updated 4 years ago
- PyTorch code for meta seq2seq learning☆43Jan 14, 2020Updated 6 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- EDA를 한국어 데이터에서도 사용할 수 있도록 WordNet을 추가☆102Apr 29, 2020Updated 6 years ago
- 한국어 문서에 노이즈를 추가합니다.☆27Nov 9, 2022Updated 3 years ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- reference pytorch code for intent classification☆44Oct 18, 2024Updated last year
- Data Augmentation Toolkit for Korean text.☆52Nov 16, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 5 years ago
- Kobart model on Huggingface transformers☆64Feb 15, 2022Updated 4 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated last year
- Tutorial for pretraining Korean GPT-2 model☆67Jun 12, 2023Updated 2 years ago
- ☆13Nov 10, 2021Updated 4 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer☆19Feb 4, 2025Updated last year
- Named Entity Recognition on CoNLL dataset using BiLSTM+CRF implemented with Pytorch☆41Jun 5, 2019Updated 6 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Selections from EMNLP 2020☆58Jun 4, 2021Updated 4 years ago
- ☆14Sep 29, 2025Updated 7 months ago
- A light-weight version of rosdoc that does not rely on ROS infrastructure for crawling packages.☆10Apr 16, 2024Updated 2 years ago
- Facilitate the learning, practicing, and designing of neural text matching models with a user-friendly and interactive interface.☆42Dec 8, 2022Updated 3 years ago