A Python library for calculating a large variety of metrics from text
☆366May 5, 2026Updated last month
Alternatives and similar repositories for TextDescriptives
Users that are interested in TextDescriptives are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆103May 23, 2026Updated 2 weeks ago
- Active Learning for Text Classification in Python☆644May 24, 2026Updated 2 weeks ago
- Implementation of the ClausIE information extraction system for python+spacy☆229Aug 8, 2022Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated 2 years ago
- spaCy pipeline object for negating concepts in text☆282Apr 20, 2026Updated last month
- PYthon Automated Term Extraction☆317Feb 8, 2023Updated 3 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆221Jan 20, 2025Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Oct 20, 2025Updated 7 months ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆172Nov 7, 2022Updated 3 years ago
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,270Jul 24, 2025Updated 10 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- [EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment☆132Mar 7, 2023Updated 3 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77May 29, 2026Updated last week
- Zero and Few shot named entity & relationships recognition☆401Sep 17, 2025Updated 8 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 3 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,372Feb 18, 2026Updated 3 months ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆81Apr 30, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fuzzy string matching, grouping, and evaluation.☆798Jul 10, 2025Updated 11 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆480Feb 6, 2023Updated 3 years ago
- SpanMarker for Named Entity Recognition☆476Apr 10, 2026Updated 2 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆419Jan 31, 2025Updated last year
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆20Jun 17, 2025Updated 11 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆60May 11, 2023Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆341Apr 25, 2025Updated last year
- Few-shot Named Entity Recognition☆119Mar 30, 2022Updated 4 years ago
- A Scandinavian Benchmark for sentence embeddings☆45Dec 5, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆111Apr 16, 2024Updated 2 years ago
- Dataframe Integration with spaCy.☆103Mar 12, 2021Updated 5 years ago
- Efficient few-shot learning with Sentence Transformers☆2,743May 26, 2026Updated 2 weeks ago
- just a bunch of useful embeddings for scikit-learn pipelines☆526Feb 12, 2026Updated 3 months ago
- Confection: the sweetest config system for Python☆193Mar 27, 2026Updated 2 months ago
- 🧹 Python package for text cleaning☆1,020May 15, 2026Updated 3 weeks ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,996Updated this week