ipeirotis / ReadabilityMetrics
A web service that computes a set of readability metrics for text. We currently support the following metrics: Automated Readability Index, Coleman-Liau Index, Flesch–Kincaid Grade Level, Flesch Reading Ease, Gunning-Fog Index, SMOG score, and SMOG Index.
☆71Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ReadabilityMetrics
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated 9 months ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- XTractor is an algorithmic text extractor from web pages written in Java. It builds upon the "commonly used web design practices" approac…☆43Updated 8 years ago
- Read natural language interactive queries. Great for bots.☆18Updated 8 years ago
- Extract opionion phrases from user reviews☆62Updated 10 years ago
- REST API for Text Summarization and Keywords Extraction☆16Updated 2 years ago
- Open Source implementation of Summly☆47Updated 7 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆28Updated 3 years ago
- Multilingual automatic text summarizer using statistical approach and extraction☆33Updated 5 years ago
- Performs multi document summarization. Includes a method to generate summaries: The method uses a sentence importance score calculator ba…☆37Updated 11 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Updated 10 years ago
- A pipeline for crawling of RSS feeds and the associated content. Demo at newsfeed.ijs.si.☆21Updated 12 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 7 months ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆145Updated 9 months ago
- Normalized dataset of 70k job titles☆63Updated 8 months ago
- Sentiment Analysis in Javascript using the AFINN Lexicon☆27Updated 5 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Events and Situations Ontology☆13Updated 6 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- ☆18Updated 3 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆48Updated 12 years ago
- A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provi…☆16Updated 11 years ago
- SVO extraction using NLTK☆37Updated 5 years ago
- White house data jam: Skill extraction from unstructured text.☆27Updated 10 years ago
- Extract postal addresses from the DOM☆66Updated 12 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago