aarmea / readability-scrapeLinks
Retrieve simplified versions of webpages, powered by Mozilla's Readability.js
☆15Updated 7 years ago
Alternatives and similar repositories for readability-scrape
Users that are interested in readability-scrape are comparing it to the libraries listed below
Sorting:
- DIY Atom feeds in times of social media and paywalls☆85Updated this week
- Bundle external assets in a HTML file to distribute a stand-alone HTML document.☆42Updated 7 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆31Updated 3 months ago
- Tool to index and serve HTML files. Powered by Datasette.☆111Updated 3 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆42Updated last year
- Tool for real-time scraping of news articles.☆39Updated 6 years ago
- ☆78Updated 3 years ago
- 📑 Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs☆70Updated last year
- An adaptation of rss2mail that uses IMAP directly☆86Updated 4 years ago
- Powerful command-line tool for slicing & dicing HTML☆38Updated 3 years ago
- Create a SQLite database containing data from your Pocket account☆107Updated 2 years ago
- 🦛 scrapes websites and generates rss feeds☆54Updated 11 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆61Updated last year
- A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!☆53Updated 2 years ago
- Build a search index across content from multiple SQLite database tables and run faceted searches against it using Datasette☆199Updated 4 years ago
- Add browser pages to your local YACY index☆15Updated 2 years ago
- A simple headless browser☆76Updated 2 years ago
- Plugin based RSS feed generator for sites that don't offer any. Serves RSS, Atom and JSON Feeds.☆89Updated 4 years ago
- Personal WayBack Machine☆129Updated 6 years ago
- Web RSS aggregator and reader compatible with the Fever API☆147Updated last year
- 📰 Build RSS 2.0 feeds from websites (and JSON APIs) automatically or with a few CSS selectors.☆137Updated last month
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆173Updated 5 years ago
- Tools for personal analytics using SQLite and Datasette☆128Updated last year
- A Document Managment System (DMS) using an IMAP Server as storage and for querying.☆14Updated 10 years ago
- Save data from Google Takeout to a SQLite database☆118Updated 2 years ago
- Google News RSS as OPML☆25Updated 7 years ago
- One-Click User Instigated Preservation☆129Updated 7 years ago
- What if cron and systemd had a baby?☆58Updated last month
- Press Cmd + Alt + I☆49Updated 2 months ago
- A journal, of sorts.☆13Updated 4 years ago