bibanon / webcache-scraperLinks
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper
Users that are interested in webcache-scraper are comparing it to the libraries listed below
Sorting:
- Recover lost websites from the Web Infrastructure☆89Updated 4 months ago
- URLTeam's second generation of URL shortener archiving tools☆79Updated 3 months ago
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆385Updated 9 months ago
- Tool and library for handling Web ARChive (WARC) files.☆165Updated last year
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆167Updated 3 months ago
- Uploads items into the Internet Archive after they have been downloaded with youtube-dl☆15Updated 10 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆135Updated 8 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year
- Grabbing all news.☆62Updated 5 years ago
- A tool that helps with analysis of obfuscated JavaScript☆11Updated 2 years ago
- Selective 4chan archive☆37Updated 4 years ago
- Easily archive important Reddit post threads onto your computer☆58Updated 3 years ago
- 💾 YouTube video metadata archiver written in Golang☆21Updated 5 years ago
- A collection of tools to make it easier to generate wordlists and collect tripcodes☆18Updated 8 years ago
- 🖼 Image Extraction Tool☆18Updated 4 years ago
- NOTE: This project is no longer being actively developed.. Check out https://replayweb.page / https://github.com/webrecorder/replayweb.pa…☆200Updated 10 months ago
- ArchiveBot, an IRC bot for archiving websites☆402Updated 4 months ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆188Updated last year
- TorrentFinder.sh is a "simple" bash script that uses wget, grep, etc. to list the top 5 torrents found on each of the 8 sites based on th…☆12Updated 7 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆130Updated 3 months ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆38Updated 3 years ago
- Awesome Bulletin Board/Forum List☆20Updated 7 years ago
- Web archiving using Google Chrome☆46Updated 5 years ago
- Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushs…☆52Updated 3 years ago
- Archiving URLs (outlinks) from a variety of sources.☆24Updated last week
- The subreddit archiver☆176Updated 2 years ago
- Easily archive important Reddit post threads onto your computer☆63Updated 3 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆54Updated 7 years ago
- data with similar subreddits graph☆48Updated 2 years ago