TextAugment: Text Augmentation Library
☆433Mar 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for textaugment
Users that are interested in textaugment are comparing it to the libraries listed below
Sorting:
- Data augmentation for NLP☆4,650Jun 24, 2024Updated last year
- Data augmentation for NLP, presented at EMNLP 2019☆1,651Mar 19, 2023Updated 2 years ago
- TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs…☆3,379Jul 10, 2025Updated 8 months ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786May 19, 2024Updated last year
- Collection of papers and resources for data augmentation for NLP.☆831Aug 12, 2022Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Exploring mixup strategies for text classification☆31Dec 16, 2020Updated 5 years ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,050Jan 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆142Mar 24, 2025Updated 11 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆708Jul 31, 2023Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆199Dec 18, 2022Updated 3 years ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,235Aug 25, 2025Updated 6 months ago
- Code for EMNLP 2020 paper CoDIR☆41Oct 4, 2022Updated 3 years ago
- Official PyTorch Implementation of SSMix (Findings of ACL 2021)☆63Jun 16, 2021Updated 4 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Apr 24, 2023Updated 2 years ago
- Unsupervised Data Augmentation (UDA)☆2,204Aug 28, 2021Updated 4 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆229Mar 24, 2023Updated 2 years ago
- Topic Inference with Zeroshot models☆61Jun 12, 2023Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,452Feb 20, 2026Updated 3 weeks ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Compute Sentence Embeddings Fast!☆625Mar 2, 2023Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- State-of-the-Art Text Embeddings☆18,390Updated this week
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆604Feb 7, 2024Updated 2 years ago
- Active Learning for Text Classification in Python☆637Mar 8, 2026Updated last week
- Minimal keyword extraction with BERT☆4,123Feb 3, 2026Updated last month
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,109Nov 14, 2024Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆262Aug 21, 2025Updated 6 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,266Jul 24, 2025Updated 7 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,412Aug 30, 2023Updated 2 years ago
- Long-context pretrained encoder-decoder models☆96Oct 28, 2022Updated 3 years ago
- EMNLP BlackBox NLP 2020: Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples☆26Oct 11, 2020Updated 5 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆27Sep 12, 2021Updated 4 years ago
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆439May 26, 2025Updated 9 months ago