Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
☆42Sep 16, 2024Updated last year
Alternatives and similar repositories for readability-extractor
Users that are interested in readability-extractor are comparing it to the libraries listed below
Sorting:
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Feb 2, 2024Updated 2 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆13Dec 8, 2022Updated 3 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆19Nov 25, 2025Updated 3 months ago
- The code for my website, including the game of life and other easter eggs.☆20Nov 11, 2024Updated last year
- Web archiver to bundle web page and its resources into single file☆15Jun 21, 2020Updated 5 years ago
- Markbox (FKA Bookmarky) is tag-based bookmarking tool inspired by Pinboard.in☆15Oct 9, 2022Updated 3 years ago
- An R Package for Building Books or Documents using pandoc☆10Aug 31, 2021Updated 4 years ago
- 🏫 List of awesome things for NodeSchool people☆13Mar 12, 2017Updated 9 years ago
- Read any web page from the command line using readability.js☆13Jul 15, 2020Updated 5 years ago
- Add your configs for tmux☆18Apr 3, 2022Updated 3 years ago
- 🔒📈 Host file tools written in rust.☆15Dec 23, 2025Updated 2 months ago
- Experimental Linux client for LBRY/Odysee.☆16Jul 27, 2025Updated 7 months ago
- [archived] Archive your Firefox, Shaarli or delicious bookmarks☆57Apr 4, 2023Updated 2 years ago
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆16Aug 1, 2025Updated 7 months ago
- This project has moved☆11Sep 9, 2023Updated 2 years ago
- ☆13Mar 12, 2021Updated 5 years ago
- Similar to *script* without replay but with a mechanism to inject keytrokes in the slave's keyboard queue☆17Aug 1, 2021Updated 4 years ago
- ☆14Dec 17, 2021Updated 4 years ago
- 🔍 🔘 ⏯️ 🔁 - search for videos to play from youtube.com and other platforms...☆17Sep 9, 2021Updated 4 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆57Aug 15, 2024Updated last year
- Simple, strong and standardized keyed password storage.☆22May 17, 2018Updated 7 years ago
- Bookmark and archive webpages from the command line☆33Feb 6, 2019Updated 7 years ago
- A tool for communicating between web pages☆33Oct 16, 2024Updated last year
- Internal Pages module for Laravel Nova☆16Feb 7, 2020Updated 6 years ago
- Browser extension. Prevents annoying "video paused" dialogs from showing up☆14Jun 23, 2019Updated 6 years ago
- Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)☆21May 19, 2021Updated 4 years ago
- JavaScript wrapper library for Pushshift with Snoowrap support.☆10Jun 18, 2021Updated 4 years ago
- A Clojure library for deconstructing Korean unicode syllable characters into alphabet characters☆10Nov 22, 2021Updated 4 years ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.☆16Oct 5, 2024Updated last year
- Using zoxide in xxh. Zoxide is a faster way to navigate your filesystem.☆20Feb 6, 2023Updated 3 years ago
- Mpv integration with gallery-dl☆24Feb 4, 2023Updated 3 years ago
- 😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, B…☆389May 19, 2025Updated 10 months ago
- Automated journalism from data and time series analysis☆11Mar 7, 2016Updated 10 years ago
- Small, dependency-free, fast Nim package and CLI tool for removing tracking fields from URLs.☆11Mar 2, 2022Updated 4 years ago
- Scrub recipes from popular cooking websites☆18Feb 23, 2014Updated 12 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆58Aug 27, 2025Updated 6 months ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆33Dec 13, 2025Updated 3 months ago
- Export all your github repositories to a form suitable for 'myrepos' to work with.☆54Dec 22, 2024Updated last year