ArchiveBox / readability-extractor
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
☆37Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for readability-extractor
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆15Updated 9 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆15Updated last month
- something like a public wiki, - a place to store notes, ideas, blogposts, photography, or writing☆17Updated this week
- Export your Github activity: events, repositories, stars, etc.☆48Updated last year
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆41Updated this week
- Proxies third-party PDF files and HTML pages with the Hypothesis client embedded, so you can annotate them☆20Updated this week
- Bookmarklet for multicolumn reader mode.☆15Updated 7 months ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆19Updated last year
- Browser extension to create a markdown link with quoted text☆21Updated 2 years ago
- Awesome links related to RSS, ATOM, and Syndication formats.☆49Updated 4 months ago
- A collection of browser bookmarklets☆12Updated 2 years ago
- Extend Firefox's history capabilities with browsing stats, improved searching and additional features☆24Updated 10 months ago
- Collaborative cheatsheets for console commands (tldr project) now in your Browser!☆13Updated 2 years ago
- Encapsulate dom-anchor-text-quote and dom-anchor-text-position for use in browser scripts☆10Updated 3 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆49Updated last month
- rsstodolist Firefox and Chrome addon (using Web Extension API)☆13Updated last year
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆14Updated this week
- Command-line program for organizing and managing ebook collections. It is a Python port from the original shell scripts ebook-tools☆22Updated 6 months ago
- Chrome Extension for Hacker News and Reddit Links☆26Updated last year
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated last month
- 💫 A lightweight browser extension to jump to various external bookmarks from the address bar.☆22Updated this week
- Derived from https://github.com/telerik/kendo-ui-core☆12Updated last year
- knowledgebase sharing experiment☆20Updated 4 years ago
- A collection of curated home built packages for the cross-platform text expander Espanso☆35Updated 4 months ago
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆14Updated last year
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆50Updated 3 months ago
- Import data from Google Takeout to search and analyze☆16Updated last year
- Encode/decode binary data over a live streaming video in real time.☆13Updated last year
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆29Updated 2 months ago