ArchiveBox/readability-extractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ArchiveBox/readability-extractor)

ArchiveBox / readability-extractor

Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

☆43

Alternatives and similar repositories for readability-extractor

Users that are interested in readability-extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArchiveBox / DigestBox
View on GitHub
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…
☆21Feb 2, 2024Updated 2 years ago
ArchiveBox / internet-archiving-talk
View on GitHub
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
☆15Oct 19, 2020Updated 5 years ago
aurelg / linkbak
View on GitHub
linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.
☆13Dec 8, 2022Updated 3 years ago
pirate / nicksweeting.com
View on GitHub
The code for my website, including the game of life and other easter eggs.
☆20Nov 11, 2024Updated last year
RealMelkor / Vgmi
View on GitHub
Gemini client with vim-like keybindings
☆16May 26, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uriel1998 / muna
View on GitHub
Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…
☆19Nov 25, 2025Updated 7 months ago
lpil / lww-register-crdt
View on GitHub
The last-write-wins register CRDT
☆18Nov 10, 2024Updated last year
MyOS-ArchLinux / vids
View on GitHub
🔍 🔘 ⏯️ 🔁 - search for videos to play from youtube.com and other platforms...
☆17Sep 9, 2021Updated 4 years ago
aarmea / readability-scrape
View on GitHub
Retrieve simplified versions of webpages, powered by Mozilla's Readability.js
☆15Oct 14, 2018Updated 7 years ago
OpenStarscape / starscape-server
View on GitHub
This project has moved
☆11Sep 9, 2023Updated 2 years ago
paulo1er / WorkFlowy-Export
View on GitHub
☆13Mar 12, 2021Updated 5 years ago
Dotz0cat / walld
View on GitHub
A wallpaper daemon
☆23May 6, 2025Updated last year
trizen / lbry-viewer
View on GitHub
Experimental Linux client for LBRY/Odysee.
☆17Jul 27, 2025Updated 11 months ago
ttscoff / TextBuddyScripts
View on GitHub
☆14Dec 17, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
oxalica / ghoti-shell
View on GitHub
☆15Apr 8, 2025Updated last year
ArchiveBox / abx-spec-behaviors
View on GitHub
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…
☆20Jul 11, 2025Updated last year
jupyterlab / jupyterlab-mp4
View on GitHub
Example mimerenderer extension for showing mp4 videos.
☆17Aug 7, 2023Updated 2 years ago
namelivia / fitbit-http-php
View on GitHub
PHP SDK for accessing the Fitbit HTTP API
☆10Dec 26, 2022Updated 3 years ago
sooheon / hangul-utils
View on GitHub
A Clojure library for deconstructing Korean unicode syllable characters into alphabet characters
☆10Nov 22, 2021Updated 4 years ago
nodiscc / shaarchiver
View on GitHub
[archived] Archive your Firefox, Shaarli or delicious bookmarks
☆56Apr 4, 2023Updated 3 years ago
noctuid / gallery-dl-view
View on GitHub
Mpv integration with gallery-dl
☆26Mar 19, 2026Updated 4 months ago
mannau / boilerpipeR
View on GitHub
Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)
☆21May 19, 2021Updated 5 years ago
nickjevershed / Time-serious
View on GitHub
Automated journalism from data and time series analysis
☆11Mar 7, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
catesandrew / recipe-parser
View on GitHub
Scrub recipes from popular cooking websites
☆18Feb 23, 2014Updated 12 years ago
gildas-lormeau / SingleFile-Archives
View on GitHub
Pages saved with SingleFile
☆13Mar 16, 2024Updated 2 years ago
karlicoss / axol
View on GitHub
Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS
☆35Jun 29, 2026Updated 2 weeks ago
webclipper / ecosystem
View on GitHub
extensions of web clipper
☆11Dec 9, 2022Updated 3 years ago
ArchiveBox / good-karma-kit
View on GitHub
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, B…
☆395May 19, 2025Updated last year
Jelmerro / Garlmap
View on GitHub
Garlmap is the Gapless Almighty Rule-based Logical Mpv Audio Player
☆15Jul 12, 2026Updated last week
Rouji / ssh2p
View on GitHub
SSH to POST. For making weird, SSH-based pastebins.
☆22Jun 29, 2021Updated 5 years ago
liamg / darktile-themes
View on GitHub
A repository of themes for https://github.com/liamg/darktile
☆10Jul 30, 2021Updated 4 years ago
pigmonkey / goesimage
View on GitHub
Download the latest image from a NOAA Geostationary Operational Environment Satellite and set it as the desktop background
☆13Feb 17, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pirate / internet-archiving-talk
View on GitHub
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
☆57Aug 15, 2024Updated last year
x42 / sound-gambit
View on GitHub
Audio File Peak Limiter
☆19Mar 20, 2022Updated 4 years ago
bitsgalore / contextactions
View on GitHub
Collection of Caja utility scripts for MATE desktop
☆13Sep 26, 2018Updated 7 years ago
JabRef / scimappr
View on GitHub
Scientific Mind Mapping
☆15Jan 25, 2018Updated 8 years ago
chenkie / try-graphql
View on GitHub
☆10Jun 19, 2020Updated 6 years ago
pirate / awesome-selfhosted
View on GitHub
This is a list of Free Software network services and web applications which can be hosted locally. Selfhosting is the process of locally …
☆16Sep 30, 2018Updated 7 years ago
xdamman / readability-cli
View on GitHub
Read any web page from the command line using readability.js
☆13Jul 15, 2020Updated 6 years ago