dpapathanasiou / CleanScrape
A no-nonsense web scraping tool which removes the crap and preserves the content in epub and pdf formats.
☆41Updated 9 years ago
Alternatives and similar repositories for CleanScrape:
Users that are interested in CleanScrape are comparing it to the libraries listed below
- Update a local archive of your tweets.☆50Updated 12 years ago
- Scrapy python crawler/spider with post/get login (handles CSRF), variable level of recursions and optionally save to disk☆55Updated 6 years ago
- Python script that reads the iCloud tab database on macOS and pulls open tabs into an HTML Bookmark file.☆12Updated 5 years ago
- View browser history as a graph (Chrome extension)☆42Updated 7 months ago
- Automatically tag pinboard bookmarks based on page text☆8Updated 9 years ago
- Send starred github repos to pinboard☆44Updated last year
- Recon tool using Yatedo and Pipl☆9Updated 10 years ago
- A small command-line python script that creates a local backup of your Flickr data. It mirrors images, titles, description, tags, albums…☆56Updated last year
- A small python script for easy access to firefox bookmarks and browsing history☆22Updated 4 years ago
- Automatically chooses new tags for articles based on existing tagged items☆27Updated 7 years ago
- 🖱️ My sweet setup - OSX tools and tips for web developers and daily users☆9Updated last month
- File Filer; sort files into structured directory tree. Tree can be structured based on various designs such as date (file modification ti…☆48Updated 7 years ago
- Generate PGP keys with GnuPG, following best practices.☆27Updated 4 years ago
- Download subscriptions from YouTube☆11Updated 6 years ago
- Dropbox recovery tools☆61Updated 4 years ago
- NOISE creates "real-looking" text based upon a collection of reference texts, which can then be used in emails, tweets, web searches, IRC…☆15Updated 9 years ago
- A command line search multi-tool.☆22Updated 4 years ago
- Command-line tool to easily extract data from HTML or XML documents. Produces machine readable output.☆31Updated 5 years ago
- Parse OS X and iPhone Safari Internet History☆19Updated 10 years ago
- Simple bookmarking service☆20Updated last week
- Collection of Workflows for the iOS app Workflow (http://workflow.is)☆10Updated 9 years ago
- Perl script to detect the existence of transparent proxies☆20Updated 11 years ago
- A Chrome extension to keep your bookmarks in sync between browser and pinboard.in - Bookmark Folders can be any combination of tags, not …☆11Updated 10 years ago
- Back up the notes you’ve saved to Pinboard☆88Updated 3 weeks ago
- Every document published from the Snowden archive☆67Updated 9 years ago
- hacker defaults for OS X☆17Updated 9 years ago
- Simple, document-specific text snippets☆12Updated 10 years ago
- A monitoring device☆79Updated 10 years ago
- (W|H)all of lame - unencrypted password gathering under open wifi networks☆32Updated 7 years ago
- Maybe you're a guy a bit like me -- who watch a lot of series -- so I guess you already know that downloading the latest episodes of all …☆21Updated 9 years ago