bibanon / webcache-scraper
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper:
Users that are interested in webcache-scraper are comparing it to the libraries listed below
- 💾 YouTube video metadata archiver written in Golang☆19Updated 5 years ago
- Recover lost websites from the Web Infrastructure☆89Updated 4 years ago
- Grabbing all news.☆62Updated 5 years ago
- 🖼 Image Extraction Tool☆16Updated 4 years ago
- dosage is a comic strip downloader and archiver☆51Updated 5 years ago
- Specialised bot for periodical grabs and video/audio/etc. webpage scrapes.☆11Updated 7 years ago
- Some tools to help analyze the twitter archive☆62Updated 8 months ago
- JavaScript reincarnation of Google Cache Browser☆49Updated 2 years ago
- Short script for removing watermarks from PDF files. Requires pdftk.☆58Updated 6 years ago
- Tool and library for handling Web ARChive (WARC) files.☆157Updated 6 months ago
- Phantombuster's SDK☆14Updated 6 months ago
- Reverse image search extension for Google Chrome.☆75Updated 7 months ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Easily archive important Reddit post threads onto your computer☆59Updated 2 years ago
- ☆33Updated 9 years ago
- A collection of tools to make it easier to generate wordlists and collect tripcodes☆17Updated 7 years ago
- NSFW reverse image search for reddit☆63Updated 9 years ago
- ☆11Updated 3 years ago
- Official Privly Browser Extension for Google Chrome - Allows for Viewing and Posting Content on Any Website Without the Host Site Having …☆32Updated 9 years ago
- Basic python script to list following and followed blogs on Tumblr☆19Updated 10 years ago
- Nondestructive warc-in-tar to warc conversion☆26Updated 12 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆170Updated 4 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Removes evil URL parameters such as Google Analytics' utm parameters☆19Updated 7 years ago
- Media Bias Fact Check extension☆39Updated this week
- A tool that helps with analysis of obfuscated JavaScript☆11Updated last year
- scraper for facebook, gab, google and tiktok☆21Updated 9 months ago
- Download porn videos (Python)☆12Updated 7 years ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆23Updated 4 years ago