amazon-science/transformers-data-augmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amazon-science/transformers-data-augmentation)

amazon-science / transformers-data-augmentation

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

☆50

Alternatives and similar repositories for transformers-data-augmentation

Users that are interested in transformers-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

varunentropy / TransformersDataAugmentation
View on GitHub
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆134Jun 12, 2023Updated 3 years ago
seujung / t5-summarization
View on GitHub
☆25Oct 28, 2020Updated 5 years ago
1024er / cbert_aug
View on GitHub
☆65May 11, 2022Updated 4 years ago
bbuing9 / DND
View on GitHub
Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)
☆12Aug 28, 2023Updated 2 years ago
naver-ai / hypermix
View on GitHub
Code for text augmentation method leveraging large-scale language models
☆63Dec 20, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jeongukjae / python-mecab
View on GitHub
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
☆28May 21, 2021Updated 5 years ago
Beomi / exbert-transformers
View on GitHub
exBERT on Transformers🤗
☆10Jun 14, 2021Updated 5 years ago
ART-Group-it / KERMIT
View on GitHub
🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings
☆57Jan 18, 2023Updated 3 years ago
Sunkyoung / Compare-tokenizer
View on GitHub
Tokenizer 비교 실험
☆11Jan 3, 2022Updated 4 years ago
hanjanghoon / NLP_Koeran_DP
View on GitHub
2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)
☆16Oct 26, 2022Updated 3 years ago
kh-kim / deeplearning_with_pytorch
View on GitHub
☆12Mar 8, 2020Updated 6 years ago
lukasgarbas / can-we-tune-together
View on GitHub
Combining encoder-based language models
☆11Nov 11, 2021Updated 4 years ago
kushagra2101 / ChatCrazie
View on GitHub
A chatbot implemented using RNN and GloVe embeddings whch answers your query crazily
☆12Jan 1, 2020Updated 6 years ago
brendenlake / meta_seq2seq
View on GitHub
PyTorch code for meta seq2seq learning
☆44Jan 14, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Updated this week
haven-jeon / KoBART-chatbot
View on GitHub
KoBART chatbot
☆45Jun 22, 2021Updated 5 years ago
catSirup / KorEDA
View on GitHub
EDA를 한국어 데이터에서도 사용할 수 있도록 WordNet을 추가
☆102Apr 29, 2020Updated 6 years ago
hoondongkim / syntaxnet-kr
View on GitHub
Korean Training Data Set Generator for Google Syntanxnet
☆13Jun 27, 2017Updated 9 years ago
jucho2725 / ktextaug
View on GitHub
Data Augmentation Toolkit for Korean text.
☆52Nov 16, 2021Updated 4 years ago
hkjeon13 / noising-korean
View on GitHub
한국어 문서에 노이즈를 추가합니다.
☆27Nov 9, 2022Updated 3 years ago
dsindex / iclassifier
View on GitHub
reference pytorch code for intent classification
☆44Oct 18, 2024Updated last year
hyunwoongko / kobart-transformers
View on GitHub
Kobart model on Huggingface transformers
☆64Feb 15, 2022Updated 4 years ago
ros-infrastructure / rosdoc_lite
View on GitHub
A light-weight version of rosdoc that does not rely on ROS infrastructure for crawling packages.
☆10Apr 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jeongukjae / namuwiki-corpus
View on GitHub
문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.
☆19Jun 16, 2021Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
taeminlee / KoGPT2-Transformers
View on GitHub
KoGPT2 on Huggingface Transformers
☆33May 4, 2021Updated 5 years ago
zzh-SJTU / CRT-QA
View on GitHub
The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…
☆13May 19, 2025Updated last year
MrBananaHuman / KorGPT2Tutorial
View on GitHub
Tutorial for pretraining Korean GPT-2 model
☆67Jun 12, 2023Updated 3 years ago
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
JianxinMa / clrec_v1.0
View on GitHub
☆13Nov 10, 2021Updated 4 years ago
jeongukjae / korean-wikipedia-corpus
View on GitHub
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Sep 6, 2023Updated 2 years ago
Gubuzeong / Getting-Started-with-Google-BERT
View on GitHub
☆15Mar 28, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
danieldeutsch / summarize
View on GitHub
☆12Nov 11, 2019Updated 6 years ago
juand-r / EMNLP-2020
View on GitHub
Selections from EMNLP 2020
☆58Jun 4, 2021Updated 5 years ago
keep-steady / NER_pytorch
View on GitHub
Named Entity Recognition on CoNLL dataset using BiLSTM+CRF implemented with Pytorch
☆41Jun 5, 2019Updated 7 years ago
NTMC-Community / MatchZoo-Studio
View on GitHub
Facilitate the learning, practicing, and designing of neural text matching models with a user-friendly and interactive interface.
☆42Dec 8, 2022Updated 3 years ago
nawnoes / korean-wellness-chatbot-models
View on GitHub
Korean wellness chatbot models: KoGPT2 + KoBERT/KoELECTRA (PyTorch, Transformers).
☆209Jan 12, 2026Updated 6 months ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
seujung / gluonnlp_tutorial
View on GitHub
GluonNLP tutorial for Pycon2019
☆14Aug 16, 2019Updated 6 years ago