TextAugment: Text Augmentation Library
☆432Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for textaugment
Users that are interested in textaugment are comparing it to the libraries listed below
Sorting:
- Data augmentation for NLP☆4,644Jun 24, 2024Updated last year
- Data augmentation for NLP, presented at EMNLP 2019☆1,650Mar 19, 2023Updated 2 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,364Jul 10, 2025Updated 7 months ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated last year
- Collection of papers and resources for data augmentation for NLP.☆831Aug 12, 2022Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Exploring mixup strategies for text classification☆32Dec 16, 2020Updated 5 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆142Mar 24, 2025Updated 11 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆706Jul 31, 2023Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆199Dec 18, 2022Updated 3 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,231Aug 25, 2025Updated 6 months ago
- Code for EMNLP 2020 paper CoDIR☆41Oct 4, 2022Updated 3 years ago
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆63Jun 16, 2021Updated 4 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- Unsupervised Data Augmentation (UDA)☆2,204Aug 28, 2021Updated 4 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Mar 24, 2023Updated 2 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 weeks ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,412Feb 20, 2026Updated last week
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Compute Sentence Embeddings Fast!☆624Mar 2, 2023Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- State-of-the-Art Text Embeddings☆18,298Feb 20, 2026Updated last week
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆603Feb 7, 2024Updated 2 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated 3 weeks ago
- Minimal keyword extraction with BERT☆4,116Feb 3, 2026Updated 3 weeks ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 6 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,265Jul 24, 2025Updated 7 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,410Aug 30, 2023Updated 2 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the…☆2,081Aug 15, 2024Updated last year
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆26Oct 11, 2020Updated 5 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Sep 12, 2021Updated 4 years ago