ipeirotis / ReadabilityMetrics
A web service that computes a set of readability metrics for text. We currently support the following metrics: Automated Readability Index, Coleman-Liau Index, Flesch–Kincaid Grade Level, Flesch Reading Ease, Gunning-Fog Index, SMOG score, and SMOG Index.
☆71Updated 2 years ago
Alternatives and similar repositories for ReadabilityMetrics:
Users that are interested in ReadabilityMetrics are comparing it to the libraries listed below
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- XTractor is an algorithmic text extractor from web pages written in Java. It builds upon the "commonly used web design practices" approac…☆43Updated 8 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆55Updated 11 months ago
- Index Common Crawl archives in tabular format☆110Updated 2 months ago
- Open Source implementation of Summly☆47Updated 8 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com☆343Updated 5 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated this week
- A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provi…☆16Updated 11 years ago
- Crawl-Anywhere - Web Crawler and document processing pipeline with Solr integration.☆96Updated 7 years ago
- Multilingual automatic text summarizer using statistical approach and extraction☆34Updated 5 years ago
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆33Updated last year
- A huge list of stopwords collected from millions of news articles☆14Updated 7 years ago
- Automatic keyword extraction - no alchemy required!☆169Updated 9 years ago
- Automatic text summarization☆242Updated 6 years ago
- Google suggest API☆32Updated 8 years ago
- My Part of Speech Tagger☆42Updated 8 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆32Updated 3 years ago
- Normalized dataset of 70k job titles☆64Updated 10 months ago
- spaCy REST API, wrapped in a Docker container.☆266Updated 2 years ago
- English stopwords collection☆155Updated 8 years ago
- A tool that analyzes content and suggests search engine optimization improvements.☆23Updated 8 years ago
- A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.☆20Updated 8 years ago
- Wikipedia-based keyword extraction tool in Java☆21Updated 9 years ago
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- 100k+ topic labeled news articles published from thousands of news websites☆18Updated 4 years ago
- Index URLs in Common Crawl☆194Updated 7 years ago
- spaCy REST API, wrapped in a Docker container.☆16Updated 3 years ago