ArchiveBox / readability-extractor
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
☆40Updated 6 months ago
Alternatives and similar repositories for readability-extractor:
Users that are interested in readability-extractor are comparing it to the libraries listed below
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Updated last year
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆18Updated 6 months ago
- Awesome links related to RSS, ATOM, and Syndication formats.☆56Updated 8 months ago
- something like a public wiki, - a place to store notes, ideas, blogposts, photography, or writing☆19Updated this week
- Host-free RSS reader in your browser.☆16Updated last year
- Export your Github activity: events, repositories, stars, etc.☆53Updated last year
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 6 months ago
- Browser extension to create a markdown link with quoted text☆21Updated 2 years ago
- The ArchiveWeb.page Site☆30Updated 4 months ago
- rsstodolist Firefox and Chrome addon (using Web Extension API)☆13Updated 2 years ago
- Where knowledge grows.☆17Updated 5 months ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆21Updated last year
- Derived from https://github.com/telerik/kendo-ui-core☆13Updated 2 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆31Updated 2 weeks ago
- Centralize, view, edit, label and organize collections of your favorite URLs 🔗 📙☆38Updated 2 years ago
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆16Updated last week
- Nodejs server for local backups of memex.☆22Updated 2 years ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆73Updated last week
- [Moved to https://github.com/standardnotes/app] A code editor for Standard Notes with syntax highlighting support for over 120 programmin…☆13Updated 2 years ago
- Encapsulate dom-anchor-text-quote and dom-anchor-text-position for use in browser scripts☆10Updated 3 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆53Updated 2 months ago
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆14Updated 2 years ago
- Oneplaybook app helps you capture, organize and share knowledge better with TiddlyWiki.☆15Updated 3 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆53Updated 7 months ago
- A set of scripts that connect various apps to Raindrop.io☆17Updated last month
- Uroute: Route URLs to configured browsers☆32Updated 2 years ago
- A general-purpose personal data management system☆13Updated last year
- ☆10Updated 3 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆16Updated last month
- Rename Hypothesis tags☆15Updated 5 years ago