bibanon / webcache-scraper
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
β27Updated 6 years ago
Related projects β
Alternatives and complementary repositories for webcache-scraper
- Grabbing all news.β62Updated 4 years ago
- πΎ YouTube video metadata archiver written in Golangβ19Updated 4 years ago
- πΌ Image Extraction Toolβ16Updated 3 years ago
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.β23Updated 11 years ago
- Tool and library for handling Web ARChive (WARC) files.β150Updated last month
- A Memento Aggregator CLI and Server in Goβ57Updated 6 months ago
- Web archiving using Google Chromeβ42Updated 4 years ago
- β36Updated last year
- Nondestructive warc-in-tar to warc conversionβ25Updated 11 years ago
- URLTeam's second generation of URL shortener archiving toolsβ72Updated 2 weeks ago
- Reverse image search extension for Google Chrome.β69Updated 2 months ago
- A commandline tool and Python library for archiving data from Facebook using the Graph API.β77Updated 6 years ago
- NSFW reverse image search for redditβ59Updated 9 years ago
- β26Updated 11 years ago
- β32Updated 8 years ago
- A collection of tools for archiving and analysing the internet.β70Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ168Updated last month
- Bot for operating snscrape in #archivebot on efnetβ10Updated 4 years ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β152Updated 4 years ago
- Provide what you expected from Instagram.β22Updated 3 years ago
- Recover lost websites from the Web Infrastructureβ85Updated 3 years ago
- Converts WARC files to static HTMLβ39Updated 4 months ago
- Uploads items into the Internet Archive after they have been downloaded with youtube-dlβ15Updated 9 years ago
- Deep Zoom Image Downloaderβ18Updated 6 months ago
- Serving content from a WARCβ60Updated 11 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.β42Updated 6 years ago
- An in-progress collection of sites that spread clickbait, hoaxes, propaganda and disinformation.β96Updated last year
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ169Updated 4 years ago
- Batch reverse image search toolβ22Updated 5 years ago