dsfsi / textaugment
TextAugment: Text Augmentation Library
☆415Updated last year
Alternatives and similar repositories for textaugment:
Users that are interested in textaugment are comparing it to the libraries listed below
- Collection of papers and resources for data augmentation for NLP.☆829Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆381Updated last year
- Officially supported AllenNLP models☆539Updated 2 years ago
- PyTorch deep learning models for document classification☆594Updated last year
- Autoregressive Entity Retrieval☆781Updated last year
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated last year
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,115Updated 6 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.☆216Updated last year
- Code for using and evaluating SpanBERT.☆895Updated last year
- Data augmentation for NLP, presented at EMNLP 2019☆1,622Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆780Updated 9 months ago
- ☆344Updated 3 years ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆229Updated 4 years ago
- Compute Sentence Embeddings Fast!☆621Updated 2 years ago
- Plot the vector graph of attention based text visualisation☆372Updated 5 years ago
- Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632☆265Updated 3 years ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆429Updated last year
- Semantics-aware BERT for Language Understanding (AAAI 2020)☆287Updated 2 years ago
- Repository for TweetEval☆365Updated 2 years ago
- Code for ACL 2020 paper: "Extractive Summarization as Text Matching"☆520Updated 3 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆340Updated 2 months ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆530Updated 3 years ago
- Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"☆393Updated last year
- Multimodal model for text and tabular data with HuggingFace transformers as building block for text data☆602Updated 4 months ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆227Updated last year
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 3 years ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆556Updated 3 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆726Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆586Updated 7 months ago