bibanon / webcache-scraper
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper:
Users that are interested in webcache-scraper are comparing it to the libraries listed below
- Grabbing all news.☆62Updated 5 years ago
- Recover lost websites from the Web Infrastructure☆88Updated 4 years ago
- 💾 YouTube video metadata archiver written in Golang☆19Updated 5 years ago
- 🖼 Image Extraction Tool☆16Updated 3 years ago
- Download porn videos (Python)☆12Updated 6 years ago
- dosage is a comic strip downloader and archiver☆51Updated 5 years ago
- Extract all internal and external links from a URL in Python.☆13Updated last year
- ☆26Updated 11 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 6 years ago
- A python tool to extract data types such as email, URL, domains and phone numbers.☆37Updated 11 years ago
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.☆24Updated 11 years ago
- A scrapy spider to extract post, thread, and user information from a vBulletin forum to a MongoDB database.☆31Updated 9 years ago
- Google SEO scraper for "allintitle:keyword" queries.☆23Updated 10 years ago
- Web data extraction tool implemented as chrome extension☆28Updated 4 years ago
- Web data extraction tool implemented as chrome extension with much more features☆46Updated 6 years ago
- Easily archive important Reddit post threads onto your computer☆58Updated 2 years ago
- Python and selenium based (mobile) Facebook groups scraper, independent of obfuscated css selectors.☆11Updated 4 years ago
- Generates new porn descriptions based on an edited dataset of xhamster video descriptions uploaded between 2007-2016.☆54Updated last month
- WIP tag-based file organizer & search☆38Updated last year
- data with similar subreddits graph☆45Updated last year
- Provide what you expected from Instagram.☆22Updated 3 years ago
- Firefox Web Extension to save Facebook posts as images☆20Updated 3 years ago
- Python tool to monitor RSS feeds and download the linked content.☆14Updated 7 years ago
- An authorship attribution project with particular emphasis on Twitter analysis☆16Updated 3 years ago
- Unofficial Anna's Archive API written in JS.☆38Updated last year
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆23Updated 4 years ago
- Deviant Spy is a native advertising (RevContent) spy tool☆31Updated 6 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆55Updated last year
- Save a bunch of web pages as a self-contained, compressed archive file for offline storage and sharing.☆35Updated 12 years ago