dpapathanasiou / CleanScrape
A no-nonsense web scraping tool which removes the crap and preserves the content in epub and pdf formats.
☆41Updated 9 years ago
Alternatives and similar repositories for CleanScrape:
Users that are interested in CleanScrape are comparing it to the libraries listed below
- Update a local archive of your tweets.☆49Updated 12 years ago
- Automatically chooses new tags for articles based on existing tagged items☆27Updated 7 years ago
- Dropbox recovery tools☆61Updated 4 years ago
- Bot for operating snscrape in #archivebot on efnet☆10Updated 5 years ago
- Generate PGP keys with GnuPG, following best practices.☆27Updated 4 years ago
- Bookmark and archive webpages from the command line☆33Updated 6 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆31Updated 2 weeks ago
- 🖱️ My sweet setup - OSX tools and tips for web developers and daily users☆9Updated 2 months ago
- Create a Reddit throwaway account with the click of a button! 🚮☆23Updated 4 years ago
- PageArchiver (previously called "Scrapbook for SingleFile") is a Chrome extension that helps to archive pages for offline reading☆86Updated 11 years ago
- Fast extraction of all external links from wikipedia☆10Updated 6 years ago
- Python script that reads the iCloud tab database on macOS and pulls open tabs into an HTML Bookmark file.☆12Updated 5 years ago
- A command line search multi-tool.☆22Updated 4 years ago
- Perl script to detect the existence of transparent proxies☆20Updated 11 years ago
- Search engine for subtitles☆10Updated 10 years ago
- A collection of scripts used in my Taskpaper 3 workflow☆11Updated 8 years ago
- Scripts for accessing and uploading to Flickr.☆35Updated 10 years ago
- An Awesome List for getting started with web archiving☆19Updated 6 years ago
- Send starred github repos to pinboard☆44Updated last year
- Pretty-print markdown☆31Updated 12 years ago
- Automatically tag pinboard bookmarks based on page text☆8Updated 9 years ago
- A collection of small scripts to do various things☆30Updated 9 years ago
- File Filer; sort files into structured directory tree. Tree can be structured based on various designs such as date (file modification ti…☆48Updated 7 years ago
- Parse OS X and iPhone Safari Internet History☆19Updated 10 years ago
- Search the internet from your terminal. Speed read your results. Terminal nirvana.☆21Updated 4 years ago
- Search, download, convert and send files directly to your kindle from Libgen in one place.☆23Updated 2 years ago
- Read/sync your IMAP mailboxes [Python]☆17Updated 4 years ago
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser☆23Updated 3 weeks ago
- Extract known metadata from Apple's MacOS Photos library and export this metadata to EXIF/IPTC/XMP fields in the photo file For example: …☆42Updated 2 years ago
- self-hosted Lightweight News Reader with multi-user support☆40Updated last month