bibanon / webcache-scraperLinks
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper
Users that are interested in webcache-scraper are comparing it to the libraries listed below
Sorting:
- Grabbing all news.☆62Updated 5 years ago
- Tool and library for handling Web ARChive (WARC) files.☆159Updated 7 months ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆161Updated last week
- A collection of tools for archiving and analysing the internet.☆77Updated 2 years ago
- NSFW reverse image search for reddit☆63Updated 9 years ago
- Recover lost websites from the Web Infrastructure☆89Updated 4 years ago
- Easily archive important Reddit post threads onto your computer☆59Updated 2 years ago
- URLTeam's second generation of URL shortener archiving tools☆75Updated 2 weeks ago
- A Memento Aggregator CLI and Server in Go☆65Updated 3 months ago
- Serving content from a WARC☆61Updated 12 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Reverse image search extension for Google Chrome.☆78Updated 9 months ago
- 💾 YouTube video metadata archiver written in Golang☆19Updated 5 years ago
- A social media open post web archiving tool☆27Updated 3 weeks ago
- 🖼 Image Extraction Tool☆16Updated 4 years ago
- Web archiving using Google Chrome☆44Updated 5 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Converts WARC files to static HTML☆44Updated 11 months ago
- Archiving government FTPs.☆23Updated 8 years ago
- Uploads items into the Internet Archive after they have been downloaded with youtube-dl☆15Updated 10 years ago
- data with similar subreddits graph☆46Updated last year
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆179Updated 7 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated 3 months ago
- Download okCupid users public data automatically☆10Updated 3 years ago
- Basic python script to list following and followed blogs on Tumblr☆20Updated 10 years ago
- simple script to convert web resources to a single warc file☆21Updated 2 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 8 months ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year