A Python library for calculating a large variety of metrics from text
☆363May 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for TextDescriptives
Users that are interested in TextDescriptives are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆103Dec 26, 2024Updated last year
- Active Learning for Text Classification in Python☆643Updated this week
- Implementation of the ClausIE information extraction system for python+spacy☆228Aug 8, 2022Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated 2 years ago
- spaCy pipeline object for negating concepts in text☆282Apr 20, 2026Updated last month
- PYthon Automated Term Extraction☆317Feb 8, 2023Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆927Sep 2, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆221Jan 20, 2025Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Oct 20, 2025Updated 7 months ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆172Nov 7, 2022Updated 3 years ago
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,268Jul 24, 2025Updated 9 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- [EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment☆132Mar 7, 2023Updated 3 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated 4 months ago
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 8 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 3 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,368Feb 18, 2026Updated 3 months ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆81Apr 30, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fuzzy string matching, grouping, and evaluation.☆796Jul 10, 2025Updated 10 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆480Feb 6, 2023Updated 3 years ago
- SpanMarker for Named Entity Recognition☆473Apr 10, 2026Updated last month
- ✔️Contextual word checker for better suggestions (not actively maintained)☆419Jan 31, 2025Updated last year
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆20Jun 17, 2025Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆60May 11, 2023Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆341Apr 25, 2025Updated last year
- Few-shot Named Entity Recognition☆119Mar 30, 2022Updated 4 years ago
- A Scandinavian Benchmark for sentence embeddings☆45Dec 5, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆112Apr 16, 2024Updated 2 years ago
- Dataframe Integration with spaCy.☆103Mar 12, 2021Updated 5 years ago
- Efficient few-shot learning with Sentence Transformers☆2,735Apr 17, 2026Updated last month
- just a bunch of useful embeddings for scikit-learn pipelines☆526Feb 12, 2026Updated 3 months ago
- Confection: the sweetest config system for Python☆193Mar 27, 2026Updated last month
- 🧹 Python package for text cleaning☆1,014Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,975Apr 27, 2026Updated 3 weeks ago