masukomi / arc90-readability
A copy of the original Arc90 repo with links to many of the current ports.
☆225Updated 9 months ago
Alternatives and similar repositories for arc90-readability:
Users that are interested in arc90-readability are comparing it to the libraries listed below
- Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)☆204Updated 10 months ago
- Let's bring Readability to Chrome!☆210Updated 7 years ago
- Distills the DOM☆658Updated 3 years ago
- Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.☆389Updated this week
- A fork of the Arc90 Labs Readability bookmarklet☆82Updated 6 years ago
- [abandoned] python port of arc90's readability bookmarklet☆539Updated 13 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆432Updated last year
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com☆343Updated 5 years ago
- FeedHQ is a web-based feed reader☆575Updated 3 years ago
- Chrome extension to "Create WARC files from any webpage"☆219Updated last year
- Utilities for extracting notes from Notes.app. This repository is lightly maintained and mainly exists to serve as documentation and star…☆235Updated 2 years ago
- WarcDB: Web crawl data as SQLite databases.☆398Updated 8 months ago
- The next version of Tinderizer☆95Updated 6 years ago
- Automatically exported from code.google.com/p/boilerpipe☆29Updated 8 years ago
- The Hypothesis web-based annotation client.☆652Updated this week
- Custom, realtime RSS feeds for Hacker News☆552Updated last year
- C library for handling Kindle (MOBI) formats of ebook documents☆437Updated 5 months ago
- Extract data from websites using basic statistical magic☆505Updated 4 years ago
- Devtools extension, lets you locally edit files served from the web (based on mitmproxy).☆395Updated 2 years ago
- Extract clean(er), readable text from web pages via Mercury Web Parser.☆118Updated 8 months ago
- Tools for extracting data from Apple dictionary files (used by the Dictionary application on Mac).☆117Updated last year
- PDF2JSON is a conversion library based on XPDF (3.02) which can be used for high performance PDF page by page conversion to JSON and XML …☆306Updated 4 years ago
- A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!☆51Updated last year
- Manually compare various readable web extractor libraries against different websites☆21Updated 2 years ago
- Hackpad is a web-based realtime wiki.☆184Updated last year
- A toolbox and web application for working with and presenting textual material from Shakespeare to Schopenhauer, and letters to literatur…☆149Updated 10 years ago
- Rewrite of fantastic Soulver application☆140Updated 13 years ago
- A collection of tools to help with the Google Reader shutdown.☆470Updated 6 years ago
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago
- Webrecorder Desktop App!☆205Updated 4 years ago