ipeirotis / ReadabilityMetrics
A web service that computes a set of readability metrics for text. We currently support the following metrics: Automated Readability Index, Coleman-Liau Index, Flesch–Kincaid Grade Level, Flesch Reading Ease, Gunning-Fog Index, SMOG score, and SMOG Index.
☆71Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ReadabilityMetrics
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- 📖 Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dump☆71Updated 4 months ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 9 months ago
- Fast Word Segmentation with Triangular Matrix☆77Updated 3 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆172Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provi…☆16Updated 11 years ago
- NEWS: JATE2.0 Beta.11 Released, see details below.☆81Updated last year
- This page is a companion for the paper titled Towards Automatic Structuring and Semantic Indexing of Legal Documents☆28Updated 6 years ago
- XTractor is an algorithmic text extractor from web pages written in Java. It builds upon the "commonly used web design practices" approac…☆43Updated 8 years ago
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.☆756Updated 6 years ago
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- Open Source implementation of Summly☆47Updated 7 years ago
- Extract a list of keywords from a website, sorted by word count.☆51Updated 8 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Events and Situations Ontology☆13Updated 6 years ago
- ☆15Updated 12 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Updated 10 years ago
- My Part of Speech Tagger☆42Updated 8 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Index Common Crawl archives in tabular format☆106Updated this week
- Bulk Copyscape is a script that utilizes Copyscape's API to by-pass the normal bulk upload queue, allowing you to quickly check websites …☆17Updated 2 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated last month
- A Named-Entity Recogniser based on Grobid.☆49Updated 2 months ago
- A Python library to calculate the readability score of a text.☆134Updated 7 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆112Updated 8 years ago
- Client for Stanford Named Entity Reconginiton☆27Updated 6 years ago
- Automatic Document Summarizer using Bipartite HITS, Natural Language Processing (NLP)☆29Updated 12 years ago