varunkumar-dev/TransformersDataAugmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/varunkumar-dev/TransformersDataAugmentation)

varunkumar-dev / TransformersDataAugmentation

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

☆134

Alternatives and similar repositories for TransformersDataAugmentation

Users that are interested in TransformersDataAugmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / transformers-data-augmentation
View on GitHub
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆50Jun 12, 2023Updated 2 years ago
1024er / cbert_aug
View on GitHub
☆65May 11, 2022Updated 4 years ago
pfnet-research / contextual_augmentation
View on GitHub
Contextual augmentation, a text data augmentation using a bidirectional language model.
☆192Jan 3, 2020Updated 6 years ago
pdufter / staticlama
View on GitHub
☆13Apr 16, 2021Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mlpc-ucsd / BERT_Convolutions
View on GitHub
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 3 years ago
LuisaMaerz / KnowMAN
View on GitHub
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Nov 9, 2021Updated 4 years ago
AI21Labs / pmi-masking
View on GitHub
This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper
☆14Aug 9, 2021Updated 4 years ago
dykang / adventure
View on GitHub
code for ACL 2018 paper by Kang et al., "AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples "
☆17Aug 30, 2019Updated 6 years ago
TurkuNLP / wikibert
View on GitHub
BERT models for many languages created from Wikipedia texts
☆33May 25, 2020Updated 5 years ago
naver-ai / hypermix
View on GitHub
Code for text augmentation method leveraging large-scale language models
☆62Dec 20, 2021Updated 4 years ago
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
namisan / exdeep-nmt
View on GitHub
☆32Sep 27, 2021Updated 4 years ago
lancopku / text-autoaugment
View on GitHub
[EMNLP 2021] Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification
☆130Mar 11, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ModuNLP / hacking_transformers
View on GitHub
☆11Aug 12, 2020Updated 5 years ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
qcwthu / Lifelong-Fewshot-Language-Learning
View on GitHub
The code for lifelong few-shot language learning
☆55Feb 17, 2022Updated 4 years ago
MatNLP / HiCLRE
View on GitHub
☆12Mar 14, 2022Updated 4 years ago
SALT-NLP / CODA
View on GitHub
Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization
☆10Mar 7, 2022Updated 4 years ago
jasonwei20 / eda_nlp
View on GitHub
Data augmentation for NLP, presented at EMNLP 2019
☆1,652Mar 19, 2023Updated 3 years ago
yuanbit / jina-financial-qa-search
View on GitHub
☆69Feb 4, 2021Updated 5 years ago
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
hoondongkim / syntaxnet-kr
View on GitHub
Korean Training Data Set Generator for Google Syntanxnet
☆13Jun 27, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TurkuNLP / bert-eval
View on GitHub
☆10Oct 15, 2019Updated 6 years ago
INK-USC / hypter
View on GitHub
Zero-shot Learning by Generating Task-specific Adapters
☆14Apr 2, 2021Updated 5 years ago
Hi-ZenanXu / Syntax-Enhanced_Pre-trained_Model
View on GitHub
Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"
☆11Jun 1, 2021Updated 4 years ago
seujung / t5-summarization
View on GitHub
☆25Oct 28, 2020Updated 5 years ago
modulabs / beyondBERT
View on GitHub
11.5기의 beyondBERT의 토론 내용을 정리하는 repository입니다.
☆57Jul 2, 2020Updated 5 years ago
hkust-nlp / SynCSE
View on GitHub
This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
☆40Jun 9, 2023Updated 2 years ago
UriSha / EmbeddinglessNMT
View on GitHub
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Jun 9, 2021Updated 4 years ago
StonyBrookNLP / multee
View on GitHub
Repository for Repurposing Entailment for Multi-Hop Question Answering Tasks, NAACL19
☆29May 4, 2020Updated 6 years ago
lukasgarbas / can-we-tune-together
View on GitHub
Combining encoder-based language models
☆11Nov 11, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SanghunYun / UDA_pytorch
View on GitHub
UDA(Unsupervised Data Augmentation) implemented by pytorch
☆278Dec 13, 2019Updated 6 years ago
akkarimi / aeda_nlp
View on GitHub
Data augmentation for NLP, accepted at EMNLP 2021 Findings
☆106Nov 30, 2023Updated 2 years ago
lovit / kmrd
View on GitHub
Synthetic dataset for recommender system created from Naver Movie rating system
☆26Dec 8, 2023Updated 2 years ago
jayded / eraserbenchmark
View on GitHub
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆100Nov 11, 2022Updated 3 years ago
BinWang28 / RSE
View on GitHub
Paper: Relational Sentence Embedding for Flexible Semantic Matching
☆12May 22, 2024Updated 2 years ago
cisnlp / ofa
View on GitHub
[NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Nov 26, 2023Updated 2 years ago
peterbhase / ExplanationRoles
View on GitHub
Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"
☆14Feb 16, 2021Updated 5 years ago