A Python library for calculating a large variety of metrics from text
☆363Mar 20, 2026Updated last month
Alternatives and similar repositories for TextDescriptives
Users that are interested in TextDescriptives are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆101Dec 26, 2024Updated last year
- Active Learning for Text Classification in Python☆638Apr 17, 2026Updated 2 weeks ago
- Implementation of the ClausIE information extraction system for python+spacy☆228Aug 8, 2022Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated 2 years ago
- spaCy pipeline object for negating concepts in text☆282Apr 20, 2026Updated last week
- PYthon Automated Term Extraction☆317Feb 8, 2023Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Jan 20, 2025Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Oct 20, 2025Updated 6 months ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Nov 7, 2022Updated 3 years ago
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,267Jul 24, 2025Updated 9 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- [EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment☆132Mar 7, 2023Updated 3 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated 3 months ago
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 7 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,364Feb 18, 2026Updated 2 months ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆80Apr 20, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆474Feb 6, 2023Updated 3 years ago
- SpanMarker for Named Entity Recognition☆470Apr 10, 2026Updated 3 weeks ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆419Jan 31, 2025Updated last year
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆20Jun 17, 2025Updated 10 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆339Apr 25, 2025Updated last year
- Few-shot Named Entity Recognition☆119Mar 30, 2022Updated 4 years ago
- A Scandinavian Benchmark for sentence embeddings☆46Dec 5, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆112Apr 16, 2024Updated 2 years ago
- Dataframe Integration with spaCy.☆103Mar 12, 2021Updated 5 years ago
- Efficient few-shot learning with Sentence Transformers☆2,724Apr 17, 2026Updated 2 weeks ago
- just a bunch of useful embeddings for scikit-learn pipelines☆524Feb 12, 2026Updated 2 months ago
- Confection: the sweetest config system for Python☆194Mar 27, 2026Updated last month
- 🧹 Python package for text cleaning☆1,010Jan 28, 2026Updated 3 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,948Apr 20, 2026Updated last week