burrsettles / readabilityLinks
Text readability metrics in Python.
☆11Updated 12 years ago
Alternatives and similar repositories for readability
Users that are interested in readability are comparing it to the libraries listed below
Sorting:
- Fast Word Clustering Software☆78Updated 7 months ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- ☆70Updated 2 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 7 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 7 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- WordNet Domains, WordNet Affect and SentiWords☆47Updated 9 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 10 months ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- A web demo for visualizing Semafor parses☆29Updated 7 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Python package for stylometry☆63Updated 4 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- bin files☆13Updated 7 months ago
- ☆59Updated 10 years ago
- ☆30Updated 3 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Fast structured perceptron sequential labeler☆15Updated 9 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Updated 9 years ago