TextAugment: Text Augmentation Library
☆436Mar 4, 2026Updated last month
Alternatives and similar repositories for textaugment
Users that are interested in textaugment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data augmentation for NLP☆4,654Jun 24, 2024Updated last year
- Data augmentation for NLP, presented at EMNLP 2019☆1,652Mar 19, 2023Updated 3 years ago
- Collection of papers and resources for data augmentation for NLP.☆833Aug 12, 2022Updated 3 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,398Jul 10, 2025Updated 8 months ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆787May 19, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Exploring mixup strategies for text classification☆31Dec 16, 2020Updated 5 years ago
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆63Jun 16, 2021Updated 4 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,051Jan 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆927Sep 2, 2024Updated last year
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- The impletation of paper titled GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis☆21Nov 23, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Unsupervised Data Augmentation (UDA)☆2,205Aug 28, 2021Updated 4 years ago
- MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification☆356Jun 5, 2020Updated 5 years ago
- Active Learning for Text Classification in Python☆637Apr 1, 2026Updated last week
- NeuSpell: A Neural Spelling Correction Toolkit☆712Jul 31, 2023Updated 2 years ago
- ☆65May 11, 2022Updated 3 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,239Aug 25, 2025Updated 7 months ago
- Code related to experimentation of different Text Data Augmentation Techniques☆14Oct 24, 2019Updated 6 years ago
- Code for EMNLP 2020 paper CoDIR☆41Oct 4, 2022Updated 3 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆142Mar 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Mar 24, 2023Updated 3 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆367Jun 27, 2022Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆135Jun 12, 2023Updated 2 years ago
- Data augmentation for NLP, accepted at EMNLP 2021 Findings☆106Nov 30, 2023Updated 2 years ago
- Minimal keyword extraction with BERT☆4,141Feb 3, 2026Updated 2 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆262Aug 21, 2025Updated 7 months ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆789Apr 24, 2023Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆199Dec 18, 2022Updated 3 years ago
- State-of-the-Art Text Embeddings☆18,494Apr 2, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,508Feb 20, 2026Updated last month
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- ☆62Apr 19, 2022Updated 3 years ago
- Compute Sentence Embeddings Fast!☆625Mar 2, 2023Updated 3 years ago
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆603Feb 7, 2024Updated 2 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago