andreasvc / readability
Measure the readability of a given text using surface characteristics
☆79Updated last month
Alternatives and similar repositories for readability:
Users that are interested in readability are comparing it to the libraries listed below
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆103Updated last year
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 2 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆97Updated 10 months ago
- 📂 Additional lookup tables and data resources for spaCy☆105Updated last month
- Cleans Reddit Text Data☆81Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 8 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- A python module for English lemmatization and inflection.☆265Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- ☆168Updated 9 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 3 months ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆66Updated last year
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- Legal document classification with EuroVoc descriptors on 22 languages.☆25Updated last year
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago