A Python library for calculating a large variety of metrics from text
☆360Jan 30, 2026Updated last month
Alternatives and similar repositories for TextDescriptives
Users that are interested in TextDescriptives are comparing it to the libraries listed below
Sorting:
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆100Dec 26, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- Implementation of the ClausIE information extraction system for python+spacy☆226Aug 8, 2022Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Nov 7, 2022Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Jan 20, 2025Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Oct 20, 2025Updated 4 months ago
- PYthon Automated Term Extraction☆318Feb 8, 2023Updated 3 years ago
- spaCy pipeline object for negating concepts in text☆282Jun 16, 2025Updated 8 months ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- Zero and Few shot named entity & relationships recognition☆401Sep 17, 2025Updated 5 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,265Jul 24, 2025Updated 7 months ago
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- SpanMarker for Named Entity Recognition☆465Jan 8, 2025Updated last year
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated last month
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆476Feb 6, 2023Updated 3 years ago
- [EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment☆132Mar 7, 2023Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Jan 31, 2025Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,351Feb 18, 2026Updated last week
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆335Apr 25, 2025Updated 10 months ago
- ☆21Aug 24, 2023Updated 2 years ago
- 🧹 Python package for text cleaning☆1,002Jan 28, 2026Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Updated this week
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆81Jan 6, 2026Updated last month
- Few-shot Named Entity Recognition☆121Mar 30, 2022Updated 3 years ago
- Top2Vec learns jointly embedded topic, document and word vectors.☆3,106Nov 14, 2024Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆110Apr 16, 2024Updated last year
- 👑 spaCy building blocks and visualizers for Streamlit apps☆853Jul 29, 2024Updated last year
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Sep 10, 2024Updated last year