aarmea / readability-scrape
Retrieve simplified versions of webpages, powered by Mozilla's Readability.js
☆15Updated 6 years ago
Alternatives and similar repositories for readability-scrape:
Users that are interested in readability-scrape are comparing it to the libraries listed below
- DIY Atom feeds in times of social media and paywalls☆83Updated 10 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆58Updated 8 months ago
- Tool for real-time scraping of news articles.☆39Updated 5 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- Generate an RSS feed from markup content and metadata☆32Updated last year
- ☆78Updated 2 years ago
- Extract clean(er), readable text from web pages via Mercury Web Parser.☆118Updated 8 months ago
- Tag-based bookmark manager inspired by delicious and Pinboard☆34Updated 2 years ago
- Web archiving using Google Chrome☆44Updated 5 years ago
- The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work…☆28Updated 7 years ago
- Tool to index and serve HTML files. Powered by Datasette.☆96Updated 3 years ago
- script that generates an rss feed out of websites that don't have one☆31Updated 6 years ago
- 🦛 scrapes websites and generates rss feeds☆53Updated last month
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆177Updated 5 months ago
- This is a python application for converting Youtube playlists and channels into podcast rss feeds.☆35Updated last year
- Bundle external assets in a HTML file to distribute a stand-alone HTML document.☆30Updated 2 years ago
- Tidying up Bash command history by putting good control in erasing certain lines.☆9Updated 2 years ago
- Grub is an AI powered Web crawler.☆19Updated 2 years ago
- Add browser pages to your local YACY index☆15Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Espial is an open-source, web-based bookmarking server.☆39Updated 2 years ago
- Bookmarks made better☆19Updated last month
- Scrapy project for collecting hyperlinks from RSS feeds using feedly's Streams API☆20Updated 2 years ago
- The distraction free writing tool I use for all my writing.☆15Updated 3 years ago
- A light weight feed reader that runs in your browser, with no backend☆51Updated 2 weeks ago
- An ambient noise mixer☆26Updated last year
- Create a static HTML/CSS image gallery from a bunch of images.☆21Updated 3 years ago
- Proxy-like server that will show you the DOM of a page after JS runs☆38Updated last year
- Simple podcast downloader (podcatcher)☆56Updated 3 weeks ago
- A server to collect & archive websites that also supports video downloads☆86Updated 2 years ago