bibanon / webcache-scraperLinks
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper
Users that are interested in webcache-scraper are comparing it to the libraries listed below
Sorting:
- Recover lost websites from the Web Infrastructure☆89Updated 4 years ago
- Grabbing all news.☆62Updated 5 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆170Updated 5 years ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆180Updated 9 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- A Memento Aggregator CLI and Server in Go☆65Updated 4 months ago
- FUPS: Forum user-post scraper☆21Updated 7 months ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated 3 weeks ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆163Updated last week
- WIP tag-based file organizer & search☆39Updated last year
- 💾 YouTube video metadata archiver written in Golang☆19Updated 5 years ago
- Easily archive important Reddit post threads onto your computer☆59Updated 2 years ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆116Updated last year
- Reverse image search extension for Google Chrome.☆78Updated 10 months ago
- The reddit Data Extractor is a cross-platform GUI tool for downloading almost any content posted to reddit. Downloads from specific users…☆238Updated 7 months ago
- An in-progress collection of sites that spread clickbait, hoaxes, propaganda and disinformation.☆100Updated last year
- Estimating the age of web resources☆96Updated last month
- Easily archive important Reddit post threads onto your computer☆62Updated 2 years ago
- Google Books Downloader / Image Scraper☆53Updated 6 years ago
- Script to import youtube-dl metadata to PostgreSQL☆14Updated 6 years ago
- Some tools to help analyze the twitter archive☆62Updated last month
- TikTok channel bulk ripper based on TikTok-Api and Youtube-dl. Some assembly may be required.☆36Updated 2 years ago
- dosage is a comic strip downloader and archiver☆51Updated 5 years ago
- Photoshopped and leaked nude photos of celebrities and normal people are circulated throughout the internet. This project is aimed to rep…☆42Updated 6 years ago
- Python suite for batch-downloading images from galleries☆78Updated 2 years ago
- URLTeam's second generation of URL shortener archiving tools☆77Updated last month
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.☆24Updated 12 years ago
- Web archiving using Google Chrome☆47Updated 5 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆129Updated 3 months ago