andreasvc / readability
Measure the readability of a given text using surface characteristics
☆76Updated 2 weeks ago
Alternatives and similar repositories for readability:
Users that are interested in readability are comparing it to the libraries listed below
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Extract dates from text☆64Updated 4 years ago
- 📂 Additional lookup tables and data resources for spaCy☆100Updated 2 weeks ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆71Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 2 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆100Updated last month
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆148Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last month
- Sentence transformers models for SpaCy☆107Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆99Updated last year
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆35Updated 9 months ago
- A multilingual lexicon of words to hurt.☆82Updated 3 months ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆31Updated 5 months ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Passive/Active sentence Transformer☆28Updated 6 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆108Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago