Augmenty is an augmentation library based on spaCy for augmenting texts.
☆157May 24, 2024Updated last year
Alternatives and similar repositories for augmenty
Users that are interested in augmenty are comparing it to the libraries listed below
Sorting:
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- A Python library for calculating a large variety of metrics from text☆360Jan 30, 2026Updated last month
- A spaCy custom component that extracts and normalizes temporal expressions☆56Feb 13, 2023Updated 3 years ago
- ☆68Mar 17, 2022Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Oct 20, 2025Updated 4 months ago
- Implementation of the ClausIE information extraction system for python+spacy☆226Aug 8, 2022Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- Confection: the sweetest config system for Python☆193Feb 9, 2026Updated 3 weeks ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Jan 20, 2025Updated last year
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated last month
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆335Apr 25, 2025Updated 10 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆199Dec 18, 2022Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆259Jul 6, 2024Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆105Apr 23, 2024Updated last year
- PYthon Automated Term Extraction☆318Feb 8, 2023Updated 3 years ago
- Dataframe Integration with spaCy.☆103Mar 12, 2021Updated 4 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- Generate reports for spaCy models.☆29May 27, 2022Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆110Apr 16, 2024Updated last year
- spaCy pipeline object for negating concepts in text☆282Jun 16, 2025Updated 8 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Jan 31, 2025Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Dec 8, 2021Updated 4 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Feb 21, 2026Updated last week
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆100Dec 26, 2024Updated last year
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Mar 18, 2024Updated last year
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆394Jul 8, 2022Updated 3 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated last month
- A spaCy wrapper for DBpedia Spotlight☆113Mar 24, 2023Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 6 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆87Oct 6, 2022Updated 3 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆11Sep 19, 2024Updated last year
- Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, …☆675Jan 25, 2026Updated last month
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago