TextAugment: Text Augmentation Library
☆439Mar 4, 2026Updated 3 months ago
Alternatives and similar repositories for textaugment
Users that are interested in textaugment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data augmentation for NLP☆4,657Updated this week
- Data augmentation for NLP, presented at EMNLP 2019☆1,652Mar 19, 2023Updated 3 years ago
- Collection of papers and resources for data augmentation for NLP.☆832Aug 12, 2022Updated 3 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,429Apr 17, 2026Updated last month
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆787May 19, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Exploring mixup strategies for text classification☆31Dec 16, 2020Updated 5 years ago
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆63Jun 16, 2021Updated 4 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- The impletation of paper titled GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis☆21Nov 23, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆358Feb 22, 2022Updated 4 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Unsupervised Data Augmentation (UDA)☆2,206Aug 28, 2021Updated 4 years ago
- MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification☆356Jun 5, 2020Updated 6 years ago
- Active Learning for Text Classification in Python☆643May 24, 2026Updated 2 weeks ago
- NeuSpell: A Neural Spelling Correction Toolkit☆712Jul 31, 2023Updated 2 years ago
- ☆65May 11, 2022Updated 4 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,248May 31, 2026Updated last week
- Code related to experimentation of different Text Data Augmentation Techniques☆14Oct 24, 2019Updated 6 years ago
- Code for EMNLP 2020 paper CoDIR☆41Oct 4, 2022Updated 3 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆143Mar 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Mar 24, 2023Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Jun 12, 2023Updated 2 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆368Jun 27, 2022Updated 3 years ago
- Data augmentation for NLP, accepted at EMNLP 2021 Findings☆106Nov 30, 2023Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 9 months ago
- Minimal keyword extraction with BERT☆4,185May 13, 2026Updated 3 weeks ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆788Apr 24, 2023Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆198Dec 18, 2022Updated 3 years ago
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,780Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,659May 13, 2026Updated 3 weeks ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- ☆62Apr 19, 2022Updated 4 years ago
- Compute Sentence Embeddings Fast!☆624Mar 2, 2023Updated 3 years ago
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆607Feb 7, 2024Updated 2 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago