makcedward / nlpaugLinks

Data augmentation for NLP

☆4,594

Alternatives and similar repositories for nlpaug

Users that are interested in nlpaug are comparing it to the libraries listed below

Sorting:

jasonwei20 / eda_nlp
Data augmentation for NLP, presented at EMNLP 2019
☆1,648Updated 2 years ago
QData / TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…
☆3,230Updated 3 weeks ago
ThilinaRajapakse / simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…
☆4,201Updated 3 months ago
marcotcr / checklist
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
☆2,038Updated last year
appvision-ai / fast-bert
Super easy library for BERT based NLP models
☆1,899Updated 11 months ago
google-research / electra
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆2,358Updated last year
facebookresearch / XLM
PyTorch original implementation of Cross-lingual Language Model Pretraining.
☆2,916Updated 2 years ago
tomohideshibata / BERT-related-papers
BERT-related papers
☆2,047Updated last year
allenai / longformer
Longformer: The Long-Document Transformer
☆2,151Updated 2 years ago
nyu-mll / jiant
jiant is an nlp toolkit
☆1,670Updated 2 years ago
princeton-nlp / SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,576Updated 9 months ago
Separius / awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
☆2,266Updated 4 years ago
facebookresearch / SentEval
A python tool for evaluating the quality of sentence embeddings.
☆2,108Updated last year
chakki-works / seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
☆1,147Updated 11 months ago
thunlp / PLMpapers
Must-read Papers on pre-trained language models.
☆3,357Updated 2 years ago
abhimishra91 / transformers-tutorials
Github repo with tutorials to fine tune transformers for diff NLP tasks
☆857Updated last year
namisan / mt-dnn
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,253Updated last year
styfeng / DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
☆828Updated 2 years ago
Tiiiger / bert_score
BERT score for text generation
☆1,781Updated last year
deepset-ai / FARM
Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
☆1,754Updated last year
facebookresearch / LASER
Language-Agnostic SEntence Representations
☆3,648Updated last year
jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
☆7,580Updated 2 months ago
microsoft / DeBERTa
The implementation of DeBERTa
☆2,123Updated last year
juand-r / entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…
☆1,546Updated last month
google-research / uda
Unsupervised Data Augmentation (UDA)
☆2,194Updated 3 years ago
nlpyang / BertSum
Code for paper Fine-tune BERT for Extractive Summarization
☆1,489Updated 3 years ago
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,401Updated 3 months ago
google-research / language
Shared repository for open-sourced projects from the Google AI Language team.
☆1,693Updated 3 weeks ago
bheinzerling / bpemb
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
☆1,216Updated 10 months ago
google-research / albert
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
☆3,273Updated 2 years ago