bibanon / webcache-scraperLinks
The Bibliotheca Anonoma's own Bing Cache and Google Cache scraper scripts. Unlike most of the other ones you've seen, these actually work.
☆28Updated 7 years ago
Alternatives and similar repositories for webcache-scraper
Users that are interested in webcache-scraper are comparing it to the libraries listed below
Sorting:
- Recover lost websites from the Web Infrastructure☆89Updated 3 weeks ago
- Easily archive important Reddit post threads onto your computer☆59Updated 3 years ago
- Grabbing all news.☆62Updated 5 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆127Updated last week
- Archiving public telegram messages.☆13Updated this week
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- 💾 YouTube video metadata archiver written in Golang☆20Updated 5 years ago
- Bot for operating snscrape in #archivebot on efnet☆11Updated 5 years ago
- data with similar subreddits graph☆48Updated 2 years ago
- Extract user info from their reddit comments and activity.☆68Updated last year
- Converts HTTrack crawls to WARC files☆33Updated last year
- Reverse image search extension for Google Chrome.☆79Updated last year
- Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.☆99Updated 3 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆132Updated 5 months ago
- ☆11Updated 3 years ago
- The reddit Data Extractor is a cross-platform GUI tool for downloading almost any content posted to reddit. Downloads from specific users…☆238Updated 8 months ago
- Distributed crawler, database and web frontend for public directories indexing☆142Updated 5 years ago
- A list of memex-related tools and their repository URLs☆152Updated 7 years ago
- Increment a URL or go to the next page. Supports auto, multi, and advanced incrementing functions.☆46Updated last month
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆116Updated last year
- The OpenWayback Development☆504Updated last year
- URLTeam's second generation of URL shortener archiving tools☆80Updated last month
- A tool that helps with analysis of obfuscated JavaScript☆11Updated last year
- 🖼 Image Extraction Tool☆18Updated 4 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆25Updated 4 years ago
- Image downloader for various imageboards and image albums written in Python.☆10Updated 5 years ago
- Generates new porn descriptions based on an edited dataset of xhamster video descriptions uploaded between 2007-2016.☆54Updated 8 months ago
- Enhance your basic booru experience☆12Updated 4 years ago
- The subreddit archiver☆179Updated last year
- Web archiving using Google Chrome☆47Updated 5 years ago